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Brief Reports 


The Journal of Consulting Psychology will 
accept Brief Reports of research studies in 
clinical psychology for early publication with- 
out expense to the author. The procedure is 
intended to permit the publication of soundly 
designed studies of specialized interest or lim- 
ited importance which cannot now be ac- 
cepted because of lack of space. Several pages 
in each issue will be devoted to Brief Reports, 
published in the order of their receipt with- 
out respect to the dates of receipt of the regu- 
lar articles. Most Brief Reports appear in the 
first or second issue to go to press following 
their final acceptance. 


An author who wishes to submit a Brief 
Report: 


1. Sends the Brief Report, limited to one printed 
page and prepared according to the specifications 
given below. 

2. Also sends to the Editor a full report of the re- 
search study, in sufficient detail to give a clear ac- 
count of its background, procedure, results, and con- 
clusions, which will be filed with the American 
Documentation Institute to insure indefinite avail- 
ability. 

3. Prepares at least 100 mimeographed copies of 
the full report, which the author will send without 
charge to all who request it as long as the supply 
lasts. 

4. Agrees not to submit the full report to another 
journal of general circulation. 


Specifications 
Brief Report. The Brief Report should give 
a clear, condensed summary of the procedure 
of the study and as full an account of the re- 
sults as space permits. 
To insure that the Brief Report will be no 
longer than one printed page, its typescript, 


including all matter except the title and the 
author’s lines, must not exceed 85 lines av- 
eraging 42 characters and spaces in length. 
Set the typewriter margins for short lines of 
42 characters, which are 3.5 inches long in 
elite typing, and 4.2 inches long in pica. 

The manuscript of the Brief Report must 
be double spaced throughout. Except for its 
short lines, it follows the standard style of 
the 1957 revision of the APA Publication 
Manual. Headings, tables, and references are 
avoided or, if essential, must be counted in 
the 85 lines. Each Brief Report must be ac- 
companied by a footnote in the style below, 
which is typed on a separate sheet and not 
counted in the 85-line quota: 


1An extended report of this study may be ob- 
tained without charge from John Doe, 300 Market 
St., Prospect 6, Mass. (giving the author’s full name 
and address), or for a fee from the American Docu- 
mentation Institute. Order Document No. , re- 
mitting $—— for microfilm or $ for photo- 
copies. 


Extended report. Because the extended re- 
port is intended for photoduplication, and is 
not copy to be sent to a printer, its style 
should differ in several ways from that of 
other manuscripts: (a) The extended report 
should be typed with single spacing for 
economy in duplication. (6) Tables and fig- 
ures should be placed adjacent to the text 
which refers to them. A caption should be 
typed below each figure. (c) Footnotes should 
be typed at the bottom of the page on which 
reference is made to them. In other respects, 
the full report is prepared in the style speci- 
fied by the Publication Manual. 
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PSYCHOTHERAPISTS’ APPROACH-AVOIDANCE 
REACTIONS TO PATIENTS’ EXPRESSIONS 
OF HOSTILITY 


ALBERT BANDURA, DAVID H. LIPSHER, ano PAULA E. MILLER 


Stanford University 


In a previous study (Bandura, 1956), an 
inverse relationship was found between thera- 
pists’ anxiety level and ratings of their psy- 
chotherapeutic competence. The purpose of 
the present investigation was to determine, 
through a response by response analysis of 


actual patient-therapist interactions, the spe- 
cific ways in which therapists’ anxieties may 
affect their psyc! work. Since 
hostility conflicts tend to be present to some 
degree in all patient 


othe rapeutic 


it was decided to single 
out for study the patient-therapist interac- 
tion around the expression of hostility. 
Although it to specify 
the conditions essential for effecting certain 
changes in a patient’s behavior, whether or 
not a particular therapist can produce these 
conditions may be partly determined by his 
own personality characteristics. A minimal 
condition for the resolution of a patient's 
conflicts would seem to be that the patient's 
conflictive feelings are permitted to 
within the therapy situation. If, 


may be possible 


occur 
however, 
certain Classes of expression are anxiety pro- 
voking for the therapist, he is less likely to 
permit or encourage the patient to express 
himself in these ways. Moreover, when the 
patient express that are 
threatening to the herapist, the anxieties he 
elicits often motivate the therapist to avoid 
a continuation of the anxiety producing in- 
teraction (Cutler, 1958; Little, 1951; Lubor- 
sky, 1952; Reich, 1951; Rigler, 1956). In 
line with this reasoning, it was predicted that 
therapists who display high hostility anxiety 
would be more likely to respond to patients’ 
hostility with avoidance reactions and less 
likely to respond with approach reactions than 
would therapists who show low hostility 
anxiety. 


does tendencies 


Hypotheses were also advanced concerning 
the effect of the therapists’ approach and 
avoidance reactions on the patients’ behavior. 
It was assumed that the therapists’ approach 
reactions would constitute positive reinforce- 
ments for the patients and that such reac- 
tions would therefore serve to encourage the 
patients to express feelings of hostility. It was 
further assumed that the therapists’ 
ance reactions would serve either as nonre- 
wards or as negative reinforcements and 
would have the effect of decreasing or in- 
hibiting the expression of hostile feelings. It 
seemed possible in some instances, particu- 
larly where the patients are strongly insti- 
gated to hostility, that the therapists’ nega- 
tive responses would lead the patients to 
displace the hostility to some other object 
rather than to discontinue all expressions of 
hostility. It was therefore predicted that if 
the patients were to continue in the hostility 
topic following the therapists’ avoidance re- 
sponses they would change the object of their 
hostility more often than they would follow- 
ing therapists’ approach responses. 


avoid- 


METHOD 


Subjects. The tape recorded interviews of 17 par- 


ents who were undergoing psychotherapy at a parent - 
child clinic provided the data for this study. Twelve 
different therapists, 2 females and 10 males, were 
represented. All were advanced clinical psychology 
students, and all except 2 were undergoing individual 
psychotherapy at the time of the study. 

Therapists’ personality characteristics. The thera- 
pists’ personality characteristics were assessed through 
ratings provided by four clinical psychology staff 
members who had extensive contact, both socially 
and professionally, with the student therapists. Each 
therapist was rated independently by three staff 
members on eight 5-point rating scales. 

Hostility anxiety, which was central to this study, 
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was assessed in three ways. In the first place, it was 
assumed that with high anxiety there would be a 
greater amount of indirect than of direct hostility so 
that, if hostility were expressed at all, its forms 
would more likely be indirect. Ratings were conse- 
quently made of the amount of direct and of the 
amount of indirect hostility that the therapists typi- 
cally displayed in their interactions with others. The 
mere absence of hostility, whether of a direct or in 
direct form, would not in itself necessarily indicate 
the presence of anxiety. For instance, one might ex- 
pect a person to show little hostility if he has ex- 
perienced little frustration or instigation to hostility. 
On the other hand, if a person strongly inhibits any 
expression of hostility in the face of strong instiga- 
tion, the presence of anxiety would seem to be in- 
dicated. Therefore, ratings were also made of the 
degree to which the therapists inhibited any expres- 
sion of hostility. The hostility inhibition scale is 
given below. 

This scale deals with the readiness with which the 
S gets irritated, annoyed, angered when frustrated, 
provoked or thwarted, eg., when delayed, incon- 
venienced, criticized, opposed, etc. 


1. Very easily irritated and angered—expresses irri- 
tation, annoyance, anger at the slightest frus- 
tration 

2. Easily irritated and angered—expresses irrita- 
tion, annoyance, anger even when mildly frus- 
trated, provoked, or thwarted 

3. Expresses irritation, annoyance, anger when 
moderately frustrated, provoked, or thwarted 

4. Difficult to arouse to anger or to irritate—ex- 

presses irritation, annoyance, anger only when 
strongly frustrated, provoked, or thwarted 

. Very difficult to arouse to anger or to irritate— 

practically never expresses irritation, annoyance, 
or anger when strongly frustrated, provoked, or 
thwarted. 


The presence of hostility anxiety was thus inferred 
from the therapist’s tendency to inhibit hostile reac- 
tions in the face of high instigation and from his 
tendency te respond with predominantly indirect 
forms of hostility. 

In addition to the three measures of hostility, the 
therapists were also rated on several scales measur- 
ing dependency behavior: amount of help seeking, 
amount of approval seeking, amount of dependency 
inhibition; they were also rated on a scale measuring 
the extent of sex inhibition; and on a scale measur- 
ing warmth. These latter measures were not used as 
a basis for any predictions. 

Procedure for rating patient-therapist interaction. 
From the total number of interviews available for 
the 17 patients, 110 interviews (approximately two 
thirds of the total) were randomly selected for analy- 
sis. Coding of these interviews was done by two 
judges who worked independently on different inter- 
views. However, in order to provide an estimate of 
the reliability of the judges’ ratings, a sample of 20 
interviews, randomly selected from the pool of 110, 
was independently coded by both judges. In the case 
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of these 20 interviews, the mean coding was used in 
the further analyses. 

In order to insure independence of the coding, the 
judges had no knowledge of the ratings of the thera- 
pists’ personality characteristics. 

Scoring unit. The unit scored was an interaction 
sequence beginning with a patient statement, the 
therapist’s response, and the immediately following 
patient response. An example is given below: 


P.: I had more work to do than I felt I should 
do, that I had to do, work which I felt was unim- 
portant, wasn’t necessary to be done. I was told 
to do it. I didn’t think it was right or necessary 
at all. 

T.: How do you feel when you have to do some- 
thing he wants you to do? 

P.: Well, I don’t like it but I do it. 


Patient response category. The patient category 
scored was hostility which was defined as any ex- 
pression of dislike, resentment, anger, antagonism, 
opposition, or of critical attitudes. In order to study 
changes in the object toward whom the patient di 
rected hostility, this category was further divided 
into six subcategories according to the referent: (a) 
spouse, (b) children, (c) parents, (d) self, (e) thera- 
pist, and (f) other persons or objects. 

Therapist response categories. The therapists’ re 
sponses were divided into two general classes: ap- 
proach and avoidance reactions. 

Approach reactions included verbal responses that 
were primarily designed to elicit from the patient 
further expressions of hostile feelings, attitudes, and 
behavior. The following classes of responses were in- 
cluded in the approach category: 


a. Approval. Therapist sanctions and expresses ex 
plicit agreement with the patient’s hostile feelings 
or behavior. 

P.: I don’t know but I got so mad when hi 
came home. 

T.: Under the circumstances, how could you 
have felt otherwise ? 
b. Exploration. Therapist asks for further clarifi- 
cation, elaboration, and detailing of the patient's 
hostile feelings or behavior. 


P.: For some reason I had a bad day, just 
couldn't fall asleep. I just felt aggravated. 

T.: Aggravated? Can you tell me a little about 
that? 
c. Instigation. Therapist shifts the discussion from 
a nonhostility topic to the hostility topic or rein- 
troduces the hostility expressed by the patient ear- 
lier in the interview or in previous interviews. 

P.: I was pretty fatigued when I got home. I 
went to bed right off the bat but I just tossed and 
turned. When I woke up in the morning I had this 
pain in the leg. I went to the doctor and he gave 
me a shot. 

T.: Yes. Let’s get back again to that evening, 
that irritated feeling you had. 
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d. Reflection. Therapist repeats or restates the pa- 
tient’s hostile feelings 

P.: And when the kids don’t listen to me it rubs 
me the wrong way. I lose my temper. 

T.: You get mad. 
e. Labeling. Therapist names the patient’s feelings, 
attitudes or behavior as hostile; points out pat- 
terns in the patient's feelings or behavior; suggests 
relationships between present hostile feelings and 
behavior and past experiences 


P.: I’ve picked up the tabs on that girl all my 
life. Always paying, always costing money. I think 
I resented her a good deal 

T.: Maybe some of your resentment for Joyce is 
partly displaced from your mother in that your 
mother shipped her to you to take care of. You 
were mad at your mother as well as Joyce. 


Avoidance reactions included those verbal responses 
designed to inhibit, discourage, or divert the pa 
tients’ hostile expressions. The following classes of 
responses were included in the avoidance category: 


a. Disapproval 
having felt or 
manner. 

P.: So I blew my top and hit her. 

T.: Just for that you hit her? 
b. Topical transition. Therapist changes the dis- 
cussion from the hostility topic to a nonhostility 
topic. 


Therapist is critical of the patient’s 
behaved in a hostile, aggressive 


P.: My mother annoys me. 
T.: How old is your mother? 


c. Silence. Therapist makes no verbal response for 
4 sec. or more after the patient has expressed hos 
tility. 


P.: I just dislike it at home so much 

T.: [Silence] 

P.: So I just don’t know what to do. 
d. Ignoring. Therapist responds to the content of 
the patient’s response but ignores the hostile affect 


t times 


P.: I lose my temper over his tardiness. 
T.: What are the results of his being tardy? 


e. Mislabeling. Therapist labels as nonhostile feel- 
ings that are clearly hostile 


P.: When are you going to give me the results 
of those tests? I think I’m entitled to know. 
T.: You seem to be almost afraid to find out. 


Unclassified responses included unscorable utter- 
ances and responses that were irrelevant to the above 
subcategories 

In coding ti 


interviews, the record was played 
aud each patient ‘tatement was judged as either hos 
tile or nonhostile. If a statement was rated as hos 
tile, both the therapist response and the immediately 
succeeding patient response were rated. The object of 
the patient’s hostility was also noted 

It would have been desirable to have used sepa- 
rate sets of judges for rating the therapist and pa- 
tient responses respectively. This could have been ac 
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TABLE 1 


INTERJUDGE AGREEMENT IN SCORING RESPONSE UNiTs 


Number 
of Units 
Perfect agreement 261 
Single discrepancy 
Therapist’s response 72 
Hostility following the therapist’s response 16 
Object of hostility 12 
Two discrepancies 
Therapist’s response and object 5 
Therapist’s response and hostility 7 
Object and hostility 1 
Two discre pane ies on ob je ct of hostility 2 
Unit scored by one judge only 81 


complished only through the use of typescripts from 
which therapist or patient statements had been de- 
leted (Dittman, 1952). On the other hand, direct re- 
cordings had certain advantages. In many instances 
a patient’s hostility is communicated more through 
voice cues than through content cues and had the 
ratings been made from typescripts some of the more 
subtle hostile responses would have been lost. As a 
partial control, however, the judge rated the thera- 
pist response before rating the immediate response it 
elicited in the patient 

The scores yielded by the content analysis were 
the number of interaction sequences in which thera- 
pists responded with approach or avoidance reactions 
to the patients’ hostility and the number of sequences 
in which hostility followed therapists’ approach and 
avoidance, respectively. The hourly totals were then 
summed for each therapy case. 


RESULTS 
Interjudge Reliability 


Since the unit of analysis is an interaction 
sequence, the most relevant index of reliabil- 
ity is the degree of agreement between the 
judges in coding the response units. There are 
a number of ways in which discrepancies can 
occur. The judges may disagree as to whether 
or not the patient is expressing hostility and/ 
or disagree as to the object toward whom the 
hostility is directed. They may differ in scor 
ing the therapist’s response. Finally, they may 
disagree on whether or not hostility is ex- 
pressed in the immediately following patient 
response and/or in the object of the patient's 
hostility. 
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TABLE 2 


FREQUENCY OF Speciric INTERJUDGE DISCREPANCIES 


Number 
of Units 


Therapists’ responses 


Reflection—labeling 
Reflection—exploration 
Reflection—Approval 
Exploration—Instigation 
Exploration—Labeling 
Labeling—Approval 


Ignoring—Topical transition 
Ignoring—Mislabeling 
Mislabeling—Topical transition 
Ignoring—Ex 
Ignoring—Labeling 
Ignoring—Reflection 


Exploration 


ploration 
} 


Topical transiti 


Object of hostility 
General—self 


General—therapist 
General—spouse 
General—child 
Therapist—self 
Therapist—spouse 
Therapist—child 
Spouse—child 
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The degree of interjudge agreement in cod- 
ing the response units was obtained from the 
20 interviews that the two judges rated in 
common. Comparison of the ratings revealed 
that in 261 of the units scored the judges 
were in perfect agreement. In 100 of the units 
they showed only minor discrepancies, most 
of which involved differences in their rating 
of the therapists’ responses (Table 1). 

The frequency with which specific types of 
scoring disagreements occurred are presented 
in Table 2. Many of the discrepancies in cod- 
ing the therapists’ responses involved cate- 
gories which overlap to some degree as, for 
example, reflection and labeling which are 
responses on an interpretative continuum. 
More marked discrepancies occurred, how- 
ever, in 37 units where one judge rated the 
therapists’ responses as avoidance whereas 
another judge rated them as some form of 
approach reaction. 

The reliability of the ratings of the thera- 


pists’ personality characteristics was esti- 
mated by the use of an analysis of variance 
technique developed by Ebel (1951). The re- 
sults are given in Table 3. 


Therapists’ Characteristics and the Use of 

Approach-Avoidance Responses 

Of the 4734 interaction sequences that were 
scored, 1619—-approximately 34% —were ones 
in which the patients expressed hostility. The 
therapists, in turn, responded with 938 ap- 
proach and 859 avoidance reactions. In or- 
der to test whether differences in therapists’ 
personality characteristics were related to the 
relative frequency of their approach of avoid- 
ance interventions, the therapists were di- 
chotomized into a high scoring and low scor- 
ing group on each of the personality scales. 
The measure of the therapists’ relative pref- 
erence for approach or avoidance reactions 
was based on the ratio of the number of ap- 
proach to the number of avoidance responses. 
The significance of the differences between 
the two groups was then tested by means of 
the Mann-Whitney U test (Siegel, 1956). 

Of the three hostility anxiety measures, 
only one—direct hostility-—yielded a statisti- 
cally significant difference. Therapists who 
expressed their hostility in direct forms were 
more likely to respond with approach reac- 
tions when the patients expressed hostility 
toward extratherapeutic objects than were 
therapists who were rated low on the direct 
hostility scale. This difference was significant 
at the .03 level. These two groups of thera- 
pists did not differ, however, in their handling 
of hostility when the therapist himself was 
the object of the patient's hostility. 


TABLE 3 


RELIABILITI 
THERAPI 


Scale 


Direct hostility 
Indirect hostility 
Hostility inhibition 
He Ip seeking 
Approval seeking 
Depe ndency inhibiti 
Sex inhibition 


Warmth 
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13 
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Psychotherapists’ 


TABLE 4 


APPROACH REAcrTi 
PATIENTS’ HosTILity 


PERCENTAGE OF 


Object of Hostility 


Pherapist rherapist 


Only one of the remaining personality meas- 
ures was related to the therapists’ handling 
of hostility. Therapists who displayed a high 
need for approval were more likely to avoid 
the patients’ hostility, whether directed to- 
ward the therapist (p < .06) or toward others 
(p < .002), than were therapists who were 
rated low on the approval seeking scale. 

In order to determine which of the specific 
approach and avoidance reactions may have 
contributed to the over-all differences that 
were obtained, separate statistical analyses 
were made for each of the therapist subcate- 
gories. The findings showed that therapists 
who readily expressed hostility in direct forms 
were less likely to ignore the patients’ hos- 
tility than were therapists who were rated low 
on the direct hostility scale (p < .03). In ad- 
dition, therapists high in approval seeking 
were less likely to make efforts to explore the 
patients’ hostility (p < .08), were more likely 
to ignore hostility (p < .004), and were more 
inclined discussion from hos- 
tility to nonhostility ( p 02) than were 
therapists who displayed low approval seeking 


to change the 


In general, therapists were less likely to ap 
proach the patients’ hostility when the thera 
pist was the object of hostility than when the 
patient directed his hostility toward others 
(Table 4). This difference, tested by means 
of the sign test for the difference between two 


Approach-Avoidance Reactions 


correlated sets of scores (Siegel, 1956), was 
significant at the .004 level. 


Influence of Therapists’ Approach and Avoid- 
ance Reactions on Patients’ Expression of 
Hostility 
The influence of the therapist’s behavior on 

the patient was measured in terms of the fre- 
quency with which the patient continued to 
express hostility following a particular thera- 
pist intervention. These results are presented 
in Table 5. Very few outright disapproval re- 
sponses were scored, consequently, this cate- 
yory was not included in the analysis. 

As can be seen from Table 5, if a therapist 
approached the patient's hostility, the patient 
almost certain to continue expressing 
such feelings. On the other hand, if the thera- 
pist responded with an avoidance 
the patient was 


was 


reaction, 
likely to drop the hostility 
topic. In testing for the significance of these 
differences, each specific approach category 
was compared with the total avoidance cate- 
gory. Similarly, each specific avoidance cate- 
gory was compared with total approach. The 
significance levels, yielded by the sign test, 
are given in Table 6 

Although the different approach reactions 
appeared to be equ lly effective in eliciting 
hostile feelings, some variability was noted 
among the avoidance reactions, with silence 
and mislabeling drawing hostility significantly 
more often than the remainder of the avoid- 
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TABLE 6 


SIGNIFICANCE OF DIFFERENCES BETWEEN THE 
APPROACH AND AVOIDANCE CATEGORIES 


Total 
Hostility 


Categories Compared 


Total approach _ vs. total avoidance 
Reflection vs. total avoidance 
Labeling vs. total avoidance 
Exploration vs. total avoidance 
Instigation vs. total avoidance 
Approval vs. total avoidance 


Ignoring vs. total approach 
Topical transition vs. total approach 
Silence vs. total approach 
Mislabeling vs. total approach 


Topical transition vs. remainder of avoidance 
Silence vs. remainder of avoidance 
Mislabeling vs. remainder of avoidance 


ance interventions. Inasmuch as silence is a 
rather ambiguous cue, the patient may at 
times interpret the therapist's behavior as in- 
dicative of interest and attentiveness rather 
than a sign of avoidance, disinterest, or dis- 
approval. In such instances, the therapist’s 
silence might be responded to by the patient 
with further expressions of hostility. As for 
mislabeling, such responses oftentimes led the 
patient to express his feelings even more di- 
rectly so as to correct the apparent failure in 
communication. 

Table 7 presents the findings on changes in 


TABLE 7 
PERCENTAGE OF TrmES PATIENTS CHANGED THE OBJECT 
oF Hostimity SpeciFic 
THERAPISTS’ RESPONSES 


of Ib jec t 
Therapist Response Changes 
Total approach 
Reflection 
Labeling 
Exploration 
Approval 
Total avoidance 
Ignoring 
Te ypical transition 
Silence 
Mislabeling 


07 
32% 


the object of hostility following the therapists’ 
responses. 

In contrast to approach reactions, the thera- 
pists’ avoidance reactions were far more likely 
to lead the patients to change the object to- 
ward whom they were expressing their hos- 
tility. These differences, tested by means of 
the sign test, were clearly significant for the 
over-all categories as well as for the subcate- 
gories (Table 8). 

Again, we find that silence and mislabeling 
had less of a disruptive effect on the patients’ 
expression of hostility than did the other 


rABLE 8 


SIGNIFICANCE LEVELS OF DIFFERENCES BETWEEN 


APPROACH AND AVOIDANCE CATEGORIES 


Categories Compared 


Total approach _ vs. total avoidance 
Reflection 
Labeling 
Exploration 
Approval 


vs. total avoidance 
vs. total avoidance 
avoidance 
avoidance 


Ignoring 

Topical transition 
Silence 
Mislabeling 


approac h 
| approacl 
approat } 


total approach 


Silence 
Mislabeling 


remainder of avoidance 
remainder of avoidance 


avoidance responses. This finding is in ac- 
cordance with that reported in the preceding 
section (Table 5). 


DISCUSSION 


The hypotheses concerning the influence of 
therapists’ hostility anxieties on their han- 
dling of hostility in the psychotherapeutic in- 
teraction were only partially confirmed. While 
therapists who, expressed hostility in direct 
forms tended to encourage their patients to 
express hostility, no significant differences 
were found between therapists who differed 
in the amount of indirect hostility or in the 
degree of hostility inhibition that they dis- 
played. The failure to find significant differ- 
ences for the latter measures may be due, in 
part, to the relative homogeneity of the thera- 
pists under study. Few, if any, of them manij- 
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fested severe hostility conflicts; and few, if 
any, were completely free of hostility anxieties. 

It was of interest to find that therapists 
who displayed a high need for approval 
tended to discourage the expression of hos- 
tility on the part of their patients. One might 
assume that a therapist who is strongly mo- 
tivated to elicit and maintain the approval of 
others would generalize to some degree the 
approval seeking to his patient as well. Thus, 
the therapist may refrain from encouraging 
the patient to express feelings that may be 
somewhat anxiety or guilt provoking, since 
to make the patient uncomfortable may lose 
him the patient’s approval. 

During the course of psychotherapy the pa- 
tient may generalize to the therapist some of 
his conflicts and emotional reactions. Once 
these responses are transferred, their modifi- 
cation will depend on the therapist’s response. 
Since the therapist has control over most of 
the conditions that may be essential for re- 
learning to occur, the patient’s behavior can 
be more readily modified in relationship to 
the therapist than in extratherapeutic inter- 
actions where interpersonal consequences are 
apt to be somewhat inconsistent or even con- 
tradictory. Our findings suggest, at least as 
far as the handling of hostility is concerned, 
that therapists are less inclined to accept and 
to analyze hostile feelings directed toward 
themselves than they are to accept and ana- 
lyze similar feelings expressed toward others 
It is difficult to know how much this finding 
may be a function of the relative inexperience 
of the therapists in the study, the majority 
of whom were still in training. 

Most previous studies that have investi- 
gated the influence of interviewers’ behavior 
on patients’ productions have focused on posi- 
tive reinforcing (Krasner, 1958; 
Rogers, 1958) except for Murray’s 
(1956) investigation, there is little data on 
the effect of therapists’ negative responses on 
patients’ verbalizations 


responses 


and, 


From the results of 
this study it is clear that avoidance reactions 
vccurred almost as frequently as did those of 
approach. The avoidance reactions, however, 
tended to take the form of nonrewards rather 
than of frankly negative reinforcements. Dis- 


approval was very rarely encountered, whereas 


same form of nonresponse was much more 
common. 

The prediction that approach responses 
would be more effective than avoidance re- 
sponses in getting the patient to express his 
hostility feelings was clearly confirmed. Not 
only were the patients apt to drop the dis- 
cussion of hostility if the therapist responded 
with avoidance, but also, in instances where 
they did continue with the topic, they were 
more likely to change the object of their hos- 
tility. In evaluating these results, it is impor- 
tant to bear in mind that the criterion of the 
influence of the therapists’ behavior was the 
immediately observable effect it had on the 
patients’ verbal behavior. Had some criterion 
of delayed outcome been used, the results 
may very well have been different. Moreover, 
the study was limited to the patients’ verbal 
behavior and the accompanying verbal reac- 
tions of the therapists. The operation of re- 
ward and punishment, on the other hand, is 
apt to take more subtle and nonverbal forms, 
particularly in the case of disapproval where 
there is little sanction for such reactions in 
the psychotherapeutic relationship. It is highly 
probable that many nonverbal cues, which in 
themselves could have produced the changes 
in the patients’ behavior, accompanied the 
therapists’ verbal responses. 

A number of the personality measures did 
not appear to be related to the therapists’ 
handling of hostility. They may, however, be 
related to the therapists’ handling of other 
forms of behavior; e.g., sexual and depend- 
ency expressions. A study of such relation- 
ships is now being planned. 


SUMMARY 


The present investigation was designed to 
test the following hypotiveses: (a) that thera- 
pists with high hostility anxiety are more 
inclined to avoid rather than to approach pa- 
tie ;’ hostility than are therapists who dis- 
play low anxiety; (6) that approach reac- 
tions encourage the patient to express further 
hostility, whereas avoidance reactions serve 
to decrease or inhibit such expression: and 
(c) that patients are more likely to change 
the object of their hostility following an avoid 
ance response by the therapist than they are 
following an approach response. 
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To test these hypotheses, tape recordings of 
110 interviews obtained from 17 patients 
treated by 12 therapists were analyzed. The 
therapists’ personality characteristics were as- 
sessed through ratings provided by clinical 
psychology stafi members. Patient-therapist 
interaction sequences were coded for the 
number of times therapists responded with 
approach or avoidance reactions to the pa- 
tients’ hostility and also for the frequency 
with which patients continued to express hos- 
tility immediately following therapists’ ap- 
proach or avoidance interventions. The ob- 
jects toward whom the patients directed their 
hostility were also rated. 

The results obtained were as follows: 

1. Therapists who typically expressed their 
own hostility in direct forms and who dis- 
played low need for approval were more likely 
to permit and encourage their patients’ hos- 
tility than were therapists who expressed 
little direct hostility and who showed high 
approval seeking behavior. No significant re- 
lationships were found for six other meas- 
ures of the therapists’ characteristics. 

2. In general, psychotherapists were more 
inclined to avoid hostility when it was di- 
rected toward themselves than when the pa- 
tients directed their hostility toward other 
objects. 

3. The patients were more likely to drop 
the hostility topic or to change the object of 


their hostility following therapists’ avoidance 
reactions than they were following the thera- 
pists’ approach reactions. 
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FORCE TRAINEES 
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During the past decade, research on sexual 

behavior has rapidly gained wider acceptance 
by the general public, and many commonly 
held attitudes are seriously challenged by new 
information. One major issue concerns the na- 
ture of homosexuality. Is homosexuality symp- 
tomatic of a more general personality disturb- 
ance or may it be regarded, in some instances 
at least, as confined to the sexual sector alone? 
In a recent article Hooker (1957) suggests, 
albeit very cautiously: 
Even if one assumes that homosexuality represents a 
severe form of maladjustment to spciety in the sexual 
sector of behavior, this does not necessarily mean 
the homosexual must bx maladjusted in 
other sectors of his behavior. Or if one assumes that 
homosexuality is a form of severe maladjustment 
internally, it may be that the disturbance is limited 
to the sexual sector alone (p. 30). 


severely 


A contrasting opinion is advanced by the 
Group for the Advancement of Psychiatry 


(1955, p. 2): “When such homosexual be- 
havior persists in an adult, it is then a symp- 
tom of severe emotional disorder.” It is with 
this conflicting opinion that the present pa- 
per is concerned. 

As part of a more extensive project, recent 
studies conducted at Lackland Air Force Base 
were analyzed to see which, if either, of the 
two views could be supported. From a mili- 
tary point of view the issue is one of practical 
importance in deciding upon proper disposi- 
tion of homosexual cases brought to light in 
the Air Force. 

The GAP report quite properly distin- 
guishes between homosexuality and homo 
sexual behavior. Homosexuality is considered 
to be a “persistent emotional and physical at- 


traction to members of the same sex,” and 
according to this report “constitutes an ab- 
normal personality development.” Adolescent 
exploratory experiences, occasional contacts 
while intoxicated, or isolated acts are more 
appropriately called homosexual behavior as 
distinguished from the condition of homosexu- 
ality. This distinction is especially important 
in the military service, where homosexual ac- 
tivity may result in trial by courtmartial and 
separation from the Air Force. Occasions also 
arise when it is necessary to identify those 
individuals who pretend to be homosexuals 
for the purpose of obtaining a discharge. 
There are a number of unique advantages 


‘to a study of male homosexuals in the Air 


Force. Nearly 100,000 airmen a year pass 
through basic training at Lackland Air Force 
Base under controlled conditions, providing a 
large sample of young adult men representing 
all characteristics of the general population. 
Most previous research on homosexuality has 
been severely limited by strong biases in the 
population studied. The almost complete lack 
of privacy and the barracks-type of living are 
likely to stimulate sexual drives in male ho- 
mosexuals. Prevailing cultural attitudes and 
stringent military policy heighten the con- 
flict, precipitating contact with the psychi- 
atric clinic. 

In the Air Force all homosexual suspects 
are subjected to exhaustive interrogations by 
special investigators. Since heterosexual sub 
jects (Ss) are not usually put through this 
procedure, it was deemed advisable in formu 
lating an experimental design to include a 
heterosexual group which had been through 
such an investigation. In the 
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Table 1 


Categories of Experimental Groups According to Homosexuality-Heterosexuality 
Dimension and Reason for Special Investigation 


Group 


H Homosexual 
(predominant 

A Homosexual 
(accessory 

D Heterosexual 
disciplinary) 

N Heterosexual 
(normal) 


tion two homosexual and two _ heterosexual 
groups, each containing 20 individuals, were 
selected for thorough case study and psycho- 
logical testing. 


Description of the Study 


All homosexual suspects referred to the psy- 
chiatric clinic at Lackland Air Force Base 
during a period of several months were given 
an experimental battery of psychological 
tests, a searching psychiatric interview, and 
a thorough case history. The psychiatrist ' 
conducting the interview rated each individual 
on a six-point heterosexual-homosexual scale 
adapted from Kinsey’s classification (Kinsey, 
Pomeroy, & Martin, 1948). The scale ranged 
from exclusively heterosexual orientation, 
through various mixtures of heterosexuality 
and homosexuality, to exclusively homosexual 
behavior. 

Twenty airmen in each of four groups were 
selected for the present study. The 20 men 
judged to be exclusively homosexual com- 
prised the H group. The A group consisted 
of men who were predominantly heterosexual 
but had experienced varying degrees of homo- 
sexual contact. Since individuals of both ho- 
mosexual groups had been subjected to spe- 
cial investigation as suspected homosexuals, 
a control group (D) was formed, consisting 
of heterosexuals who had been through a 
similar investigation for other kinds of alleged 


1 All psychiatric examinations were accomplished 
by Louis J. West, Department of Psychiatry and 
Neurology, University of Oklahoma School of Medi- 
cine, Oklahoma City, Oklahoma. 


Psychosexual Orientation 


Predominantly homosexual 


Predominantly heterosext 


Reason for 


Investigation 


Homosexual behavior 


ial Homosexual behavior 


with homosexual experiet 


Exclusively heterosexual! 


Exclusively heterosexual 


offenses. A fourth group (N) of normal het- 
erosexual men constituted a second control 
group for comparison with the two classes of 
homosexuals. 

All 80 men were given a battery of 10 tests 
in the following order: Sexual Identification 
Survey, Homosexual Homonyms, Edwards 
Personal Preference Schedule (Edwards 
1955), Heineman’s Forced-Choice Anxiety 
Scale (Heineman, 1953), Worchel’s Self-Ac- 
tivities Inventory (Worchel, 1957), the Food 
Preference and Aversion Scale, the Ror- 
schach test (including a special testing of 
limits for sex), the Blacky Pictures Tech- 
nique (Blum, 1950), the MMPI, and six sub- 
tests of the Wechsler Adult Intelligence Scale 
(Vocabulary, Information, Similarities, Pic- 
ture Completion, Block Design, and Digit 
Symbol). The Sexual Identification Survey 
was designed especially for this study and 
consisted of 30 human figures taken from 
drawings, paintings, statues, and other sources 
which were photographically reproduced in 
such a way as to make the sex ambiguous. 
The S’s task was to identify the sex of each 
figure. The Food Preference and Aversion 
Scale was adapted from Wallen (1943) and 
consisted of 29 foods to be rated by the S$ on 
a five-point like dislike scale. 

The third test developed especially for this 
study was the Homosexual Homonyms. Using 
Secord’s (1953) method, a list of 10 homo- 
nyms (blow, gay, fairy, French, fruit, queer 
rear, suck, pansy, Greek) was embedded in a 
buffer list of 50 words and given to each S 
as a word association test. 
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Each test was scored without awareness of 
the sexual class into which the S had been 
placed in order to avoid any possible bias 
Where score distributions were reasonably 
normal, comparisons among the four experi- 
mental groups were made by analysis of 
variance. Where distributions were severely 
skewed or truncated, chi square or the 
Kruskal-Wallis nonparametric test (Walker 
& Lev, 1953) was used. 


Results 


Striking differences were obtained between 
the H group and the other three groups on 
most of the MMPI scales. F ratios significant 
beyond the .01 level were obtained for F, Hy, 
Pd, Mj, Pa, Pt, and Sc; and beyond the .05 
level for Hs, D, and Si. Mean T scores for 
each scale and each group are presented in 
Fig. 1. 

The sharp difference in elevation of most 
scores in the H group, as compared to the 
other three groups, strongly suggests that the 
symptom of homosexuality is but a part of 
a more general personality disturbance. The 
close similarity of the A group to the two 
control groups is convincing evidence of the 


r-SCORE 


important distinction between the exclusively 
homosexual individual and the person who 
has only occasionally engaged in homosexual 
activities. Of particular interest is the com- 
parison of the four groups on the masculinity 

femininity scale. Only two men in the H 
group had 7 scores on the Mf scale less than 
70 (and one of these obtained a score of 67)! 
By contrast, the highest 7 score obtained in 
the A group was 70—and there were only 4 
such individuals out of 20. None of the nor- 
mal controls (N) and only one of the delin- 
quent controls (D) had an Mf score as high 
as 70. 

Similar results were obtained for Heine- 
man’s Forced-Choice Anxiety Scale. Adapted 
from Taylor’s Manifest Anxiety Scale (Tay- 
lor, 1953) and the MMPI, Heineman’s scale 
was designed to eliminate social desirability 
as a response set in anxiety questionnaires. 
All of the statistical significance in the F 
ratio between groups can be attributed to the 
unusually high mean score for the H group. 
The H, A, D, and N groups obtained mean 
scores of 46.6, 35.4, 30.5, and 31.5 respec- 
tively. Again, it is apparent that the exclu- 
sively homosexual male manifests a general 


K Hs D. Hy 


Ls 


Pd Mf Pa t Ss Ma Si 


Mean 7 scores of the MMPI for four groups of airmen (predominantly homosexual, accessory ho- 
mosexual, disciplinary, and normal). N = 80; each group = 20. 
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personality disturbance while the occasional 
homosexual does not. 

Assuming that sexual associations to the 
critical words of the Homosexual Homonyms 
are anxiety arousing, delayed response times 
may reflect this anxiety. The mean trans- 
formed scores for response times yield identi- 
cal rankings among the groups with the scores 
obtained from Heineman’s Forced-Choice Anx- 
iety Scale, and the magnitudes of the differ- 
ences are almost identical, suggesting that the 
similarity is due to the effects of anxiety. 

Food aversions have been positively corre- 
lated with anxiety and neuroticism (Smith, 
Powell, & Ross, 1955; Wallen, 1945). Com- 
bining the scores of the two homosexual 
groups and the two heterosexual groups on 
the Food Preference and Aversion Scale pro- 
vided adequate frequencies for contingency 
tables. The total list and 5 of the 29 foods 
yielded aversion scores for the combined ho- 
mosexual groups significantly different from 
the combined heterosexual groups beyond the 
.05 level. 

Based upon the assumption that sexual 
confusion may exist in a person of one sex 
who has the interests, attitudes, and motiva- 
tions of the other, the Sexual Identification 
Survey was administered to all Ss. While re- 
sponse times and deviation scores were gen- 
erally consistent with a priori expectations, 
none of the differences were statistically sig- 
nificant. These negative findings suggest that 
homosexuals, although possibly confused in 
their masculine-feminine roles, may have 
learned to compensate for their confusion. In 
order to seek out others who are also homo- 
sexual, they may have become perceptually 
attuned to recognize subtle cues of another’s 
homosexuality. This training may compen- 
sate for any role confusion in themselves or 
in their identification with others. In any 
event, these results suggest that popular in- 
terpretations of masculine-feminine role con- 
fusion in projective techniques may be in- 
valid. 

Perhaps the most important implication to 
emerge from this study concerns the severity 
of psychopathology which is likely to accom- 
pany the markedly homosexual individual. 
For this person, the findings support the con- 
tention that he is likely to be suffering from 


an emotional disorder which is relatively per- 
vasive and severe. By comparison, the acces- 
sory homosexual does not show evidence of 
extensive personality disturbance of the se- 
verity found to characterize the markedly 
homosexual individual. 

Another implication is the need for devel- 
oping assessment instruments which will in- 
crease the accuracy and the objectivity of the 
evaluation of a homosexual condition with 
respect to both its presence and its severity 
Such information would have diagnostic, 
prognostic, and treatment values. Some of 
the tests used in this study show develop- 
mental potentialities for such purposes. 

Further research similar in kind and scope 
to the present study should be undertaken for 
the female sex. The same kinds of data for 
homosexual women could be integrated with 
data from this study to provide more com- 
plete information on homosexuality in gen- 
eral. Complementary findings from both stud- 
ies might contribute to a better understand- 
ing of psychosexual development, over-all 
personality functioning, and the relationship 
of homosexuality to psychopathology. 


Summary 


A battery of 10 psychological tests, includ- 
ing 2 original tests, was administered to 80 
enlisted airmen divided equally into four 
groups. Ss were assigned to groups according 
to whether they were heterosexual, partly 
homosexual, or markedly homosexual in their 
psychosexual orientations. Only the markedly 
homosexual group gave test records that were 
strikingly different from the control groups, 
suggesting that markedly homosexual indi- 
viduals are likely to be suffering from an 
emotional disorder which is relatively per- 
vasive, severe, and disqualifying for military 
service. The partly homosexual group (com- 
posed of individuals who were predominantly 
heterosexual but with varying degrees of ho- 
mosexual experience) gave test records that 
closely approximated the results of the two 
control groups. Implications for improving 
existing methods of assessment and disposi- 
tion of homosexual cases are discussed, and 
extension of the present study to include fe- 
male members of the Air Force is recom- 
mended. 
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Several studies have consistently reported 
high frequencies in the MMPI of D and Pt 
(the so-called 27 or 72 codes) scores among 
patients suffering severe emotional disturb- 
ances, in particular depression (Welsh & 
Dahlstrom, 1956). One study (Simon & 
Hales, 1949) found a high incidence of this 
pattern in hospitalized suicidal veterans. A 
different study demonstrated a significant re- 
lationship between D and Pt and low self- 
acceptance (Zuckerman & Monashkin, 1957). 
It would seem that these two scales alone 
would be of specific value in outpatient clinics 
or other situations where brief patient con- 
tacts might preclude the use of the full 
MMPI. Arguments against using single scales 
of the MMPI as tests in themselves rest upon 
several grounds. For one thing, some studies 
indicate that single scales of the MMPI tend 
to show little validity when tested against 
standard diagnostic categories (Welsh & 
Dahlstrom, 1956). Also, there is the possi- 
bility that there may be different response 
tendencies if the scale is isolated from its 
context of being imbedded among a large 
host of other scales. While these arguments 
are compelling, it is conceivable that the dis- 
criminability of the various MMPI scales 
differs from one scale to another and would 
vary according to the kind of behavior one 
would try to predict. Whether an isolated 
scale or two used as a complete test produces 
less discrimination than the same scale in- 
cluded among a group of others is an em- 
pirical matter yet to be convincingly demon- 
strated. The evidence on hand is consistent 
in that elevations in both D and Pt are fre- 
quently found in the MMPI profiles of pa- 
tients having symptoms of depression and 
loss of self-confidence independent of specific 
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diagnoses. Thus if one were interested in the 
identification of individuals who are seriously 
depressed without regard to their diagnostic 
status, these two keys used alone in a single 
scale might be expected to work well. 

The purpose of this study was to test the 
discrimination power of a brief form of the 
MMPI on population samples differing in de- 
grees of depression using only those items 
necessary to score D and Pt. 

The experimental scale, to be referred to as 
ML (for Morale Loss), consisted of the 116 
items of the MMPI that are scored for D, 
Pt, and K. The K items were included since 
they enter into a corrected T score for Pt and 
also its role as a “suppressor variable’ has 
possible clinical as well as research value. The 
ML scale was made up in a booklet form 
allowing it to be used either with an answer 
sheet or by having the subject (S) record his 
answers directly in the booklet. The average 
time for individual administration 
tween 15 and 20 minutes. 

Three groups of Ss were tested by the ML 
scale. Group A consisted of 18 patients (nine 
female) who had been hospitalized because 
of a neuropsychiatric disorder and who had 
recently (within a period of 15 dzys prior to 
testing) made an unsuccessful active suicide 
attempt. In every instance some action was 


was be- 


‘taken which was judged to represent a bona 


fide attempt at self-injury or destruction and 
not mere verbal threats.. The Group A pa- 
tients were all given a psychiatric diagnosis 
during the hospitalization following the sui- 
cide attempt: five patients were considered to 


1 These included: gunshot wounds, slashing of 
face and neck, wrist slashing, ingestion of known 
poisons, overdose of sedatives, inhalation of gas, and 
attempt at hanging self. 
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Table 1 
Mean Scores* on the ML Form of the MMPI 


Group D Pt Pt+K &K 
A patients Mean 80.94 72.17 79.89 11.11 
(V=18) SD 14.6 15.4 15.9 34 
B patients Mean 68.33 56.56 65.67 15.94 
(N =18) SD 14.9 10.8 117 5.2 
C normals Mean 48.72 40.83 4744 17.94 
N=18) SD 7.9 44 5.2 6.9 
* Entries for K are raw scores. All others are T scores 


be schizophrenic; three, manic-depressive de- 
pressed; and the remainder severe psycho- 
neurotic reactions with emphasis placed on 
the depressive symptoms. 

Group B contained 18 patients (nine fe- 
male) who were either hospitalized for a psy- 
chiatric disorder or seen in the outpatient de- 
partment of Henry Phipps Psychiatric Clinic 
for diagnostic study and treatment. None had 
a history of overt suicide attempts nor were 
they considered suicidal at the time of study. 
The diagnoses included: four schizophrenic 
reactions, four depressive reactions (not psy- 
chotic), and the remainder psychoneurotic 
reactions with anxiety and some depressive 
features. Thus the A and B patients differed 
essentially in the fact that the former had at- 
tempted suicide and were considered more 
severely depressed. ‘ 

Group C consisted of 18 presumably nor- 
mal adults (nine female) who had no history 
of mental illness requiring psychiatric treat- 
ment and were comparable to the A and B 
patients with respect to age and socioeco- 
nomic status. The C normals were obtained 
from groups of persons working in diverse oc- 
cupations in and around a hospital setting, 
such as machinists, plumbers, clerks, labora- 
tory technicians, etc. The mean age for Group 
A was 32.2; for Group B, 31.4; for Group C, 
33.3. All ML scales were administered in in- 
dividual sessions with the Ss. The directions 
for the test were essentially the same as those 
used in the standard MMPI booklet. 

The D, Pt, and K raw scores were deter- 
mined from the appropriate MMPI keys and 
the T score equivalents for the two clinical 
scales were determined from the MMPI 
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tables. Both the K-corrected and uncorrected 
T scores for Pt were recorded. The JT scores 
of the clinical scales were evaluated in prefer- 
ence to the raw scores since half the Ss were 
female, and the 7 score norms for D and Pt 
have different values for the sexes. 


Results and Discussion 


The mean scores for the three groups on 
the D, Pt, and K scales may be seen in 
Table 1. The significance of differences among 
the mean scores of the two patient groups and 
the normal adults was evaluated by simple 
analysis of variance. Differences between ad- 
jacent group means were then analyzed by 
the ¢ test, using the within groups variances 
obtained from the analysis of variance to de- 
termine the best estimate of the standard 
error of the difference. Table 2 provides a 
summary~of the results of the analysis. It 
may be seen that both patient groups were 
significantly higher on D and either Pt vari- 
able, with Group A scoring significantly 
higher than Group B, considering the patient 
groups. Only in the case of K was there no 
significant difference found between Groups 
B and 

It is apparent that the short MMPI scale 
as used in this study does permit discrimina- 
tion between the two types of patient groups 
which differed originally in the degree of de- 
pression, as well as between the patient and 
normal adult groups. The results also tend 
to support the findings of Simon and Hales 
(1949) on their suicidal group in that a 
higher proportion of the A patients scored 


Table 2 


Summary of Tests of Differences Among Patient and 


Normal Groups on D, Pt, and Pt + K T Scores 
Values of 
F for Dif- 
ferences Values of ¢ 
amongall 
Three Group Group 
Variable Groups Avs.B Bvs.C 
D 26.88** 2.85* 4.43* 
PI 33.55** 4.09** 4.12** 
Pi+kK 39.59** 3.89** 4.98** 
AK (raw scores) 11.78* 343° 1.39 


*p <.01. 
<.001. 
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Table 3 


Distribution of the ML Key Scores of Patient and 
Normal Groups 


Group A Group B 


Range of Scores (N = 18) (N = 18) 


Group C 
(N = 18) 


29-30 
27-28 
25-26 
23-24 
21-22 
19-20 
17-18 
15-16 
13-14 
11-12 
9-10 
7-8 
5-6 
3-4 
1-2 
0 


above T score 70 on both D and Pt than did 
the B patients. Seventy-two per cent of the A 
patients scored this high on both scales as 
compared to 22% of the B patients. The find- 
ings with respect to K were consistent with 
those of other clinical studies in that the 
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value of K drops as the degree of disturbance 
increases. The Pt scale differentiated the 
groups as well as Pt + K, although the latter 
yielded higher 7 scores for each of the three 
groups. 

The fact that Pt shows high positive rela- 
tionships with D consistently and appears to 
be sensitive to emotional disturbances even 
where no dominant obsessive-compulsive or 
phobic syndromes are apparent, raises the 
point that P# measures emotional disturbance 
per se better than “psychasthenia.” Support 
for this view was provided when the data of 
the present study were analyzed for the most 
differentiating of the ML test items that 
would help separate Group A patients from B 
patients and both from normals. A total of 30 
items seemed to approximate an arbitrary cri- 
terion whereby a response in one direction to 
an item would be given by two thirds of 
Group A to one third of Group B and less 
than this by Group C.? Table 3 summarizes 


2 These items with their MMPI booklet numbers 
and the direction scored for ML are as follows: 5T, 
8F, 32T, 39T, 41T, 67T, 76T, 86T, 84T, 106T, 107F, 
145T, 153F, 182T, 189T, 236T, 238T, 290T, 301T, 
337T, 340T, 343T, 352T, 357T, 358T, 359T, 361T, 
362T, 366T, 383T. 


Table 4 


Distributions and Means of ML Key Scores of Bri 


I II 


Chronic 

Brucellosis 
Range of Group 
ML Scores (N = 9) 


20+ 

18-19 
16-17 
14-15 
12-13 
10-11 


Mean ML 
Score 


Nonchronic 
Brucellosis 
Group 
(N = 8) 


Ic€losis an 


iICMI Groups 


IV 
Low-Negative 
CMI 
Group 
(N = 26) 


Ill 
High-Positiy 
CMI 
Group 


(N = 15 
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I vs. II III vs. IV / 
= 3.41 = 5.85 


the distributions of the scores obtained by 
such a key from the three groups. When the 
items were identified as to which of the three 
MMPI keys were involved, it was found that 
22 of the 30 items are scored for Pt, 7 are 
from D (with 2 scored oppositely), and 1 
from K (also scored oppositely). 

The likely meaning applied to Pt or the 
key derived as above was explored further by 
the results obtained from other studies car- 
ried out by the writer. In one study (Im- 
boden, Canter, Cluff, & Trever, 1959) both 
D and Pt were found to differentiate between 
two groups of adults who had suffered acute 
brucellosis infections. One group had recov- 
ered completely from the illness while the 
other was chronically ill for at least several 
years and up to the time of the study. While 
the chronic group scored higher on D and Pt, 
the psychiatric studies did not suggest promi- 
nence of obsessive-compulsive or phobic symp- 
toms. However, the psychiatric studies and 
a self-concept scale indicated they were de- 
pressed and suffering low self-esteem. The 
records of both groups were scored by the 
tentative key, labeled ML, made up of the 
30 items referred to above. 

In a separate study still in progress, Ss who 
had taken the Cornell Medical Health Ques- 
tionnaire (Brodman, Erdman, & Wolff, 1956) 
were divided into two groups and compared 
on the ML key. One group had extremely 
high CMI scores indicative of positive medi- 
cal illness histories, while the other group 
represented a sample having extremely low 
CMI scores. Table 4 gives the results of com- 
paring the various groups in these two studies 
on the ML key. As can be seen, the ML vari- 
able separates the groups as might be ex- 
pected if one assumes that chronically ill or 
frequently ill people are more likely to be low 
in self-esteem or morale than healthier people. 

From such findings, the high proportion of 
Pt items in the ML key, and other studies in 
the literature with respect to Pt (Welsh & 
Dahlstrom, 1956), it would appear that the 
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Pt scale might be profitably reanalyzed with 
respect to its self-esteem components and 
those more uniquely associated with psy- 
chasthenia. The ML key is offered as only 
one tentative approach to this problem. 


Summary 


A short form of the MMPI made up of the 
D, Pt, and K items only was administered to 
two patient groups, which differed in their 
degree of depression and emotional disturb- 
ance as judged by their histories of suicidal 
attempts, and to a group of normal adults. 
The scores of the three groups on the three 
keys revealed that they could be differentiated 
from each other according to the degree of 
disturbance. Thus the effectiveness these keys 
have as ordinarily used in the entire MMPI 
scale was maintained when isolated in this 
fashion. The Pt scale was given further atten- 
tion when a key made up of the most dis- 
criminating items of the short test was found 
to be heavily weighted with Pt. The results 
of follow-up studies and the long observed 
relationship between Pt and D suggest that 
Pt should be redesignated as a morale loss or 
self-esteem scale. An alternate key to Pt was 
offered by the study toward this end. 
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THE EXPRESSION OF HOSTILITY AND GUILT IN 
MELANCHOLIC AND PARANOID WOMEN 
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That feelings of hostility and guilt, with the 
concomitant expression of extrapunitive and 
intropunitive attitudes, are important factors 
in diagnosis and psychopathology has been 
recognised for many years. The Freudian 
doctrine of superego formation, for example, 
involves aggressive and guilt components. In 
discussing symptomatology in general psychi- 
atry, Henderson and Gillespie (1950) con- 
sider that certain delusions (the idea that 
one’s thoughts are being read, and self-accu- 
satory ideas of delusional intensity) may rest 
on an affective basis of guilt. In clinical psy- 
chology, Foulds (1958) has adapted a tech- 
nique, devised by Rosenzweig, Clarke, Gar- 
field, and Lehndorff (1946), which elicits the 
expression of intropunitive and extrapunitive 
attitudes for diagnostic purposes. Fundamen- 
tally, Foulds’ adaptation measures the de- 
gree to which one is more critical or less criti- 
cal of oneself than of others. Siegel (1956) 
has developed a “hostility” scale from the 
Minnesota Multiphasic Personality Inventory 
(MMPI) which he has related to ‘“authori- 
tarianism.” He found that groups scoring 
highly on his dimension of authoritarianism 
had greater overt hostility as measured by his 
hostility scale. In a further paper (Siegel, 
Spelka, & Miller, 1957) presented at the 
American Psychological Association Conven- 
tion in September 1957, evidence was pro- 
vided of the feasibility of measuring varying 
aspects of hostility such as “extrapunitive,” 
“intropunitive,” and “projected.” Alexander 
(1949) has distinguished between provocative 
hostile behaviour, projected hostility, guilt 

1The author would like to express his indebted- 
ness to G. A. Foulds for his encouragement and ad- 
vice during the course of this investigation, and his 
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feelings, and inferiority feelings. It seems 
clear then, that Hostility and guilt are global 
terms and that different authors have sepa- 
rated out certain aspects for study. 

In line with this tradition, the present study 
is concerned with the measurement of some 
aspects of hostility and of self-criticism/ guilt 
at various levels of expression. Since interest 
was centered on measurement, persons known 
to be extreme to a pathological degree in the 
expression of these feelings, namely melan- 
cholics (guilty) and paranoid states (hostile) 
were used as subjects (Ss). 


Method 


Measurement of Extrapunitive and Intro- 
punitive Attitudes 


The purpose, in testing, has been to work 
from an obvious, overt statement of attitude, 
to an indirect, covert expression. 

At the superficial level, two hostility scales 
were made up from the MMPI, one consist- 
ing of 21 acting out hostile statements such 
as: “at times I have a strong urge to do some- 
thing harmful or shocking,” “I can easily 
make other people afraid of me, and some- 
times do for the fun of it,” and “horses that 
don't pull should be beaten or kicked.” A 
second scale, implying criticism of others and 
accusing others of hostility to oneself, was 
devised as a measure of projected hostility. 
This scale consisted of 26 items such as, “I 
think most people would lie to get ahead,” 
and “I believe I am being plotted against.” 

As a measure of self-criticism/guilt at this 
level, items implying criticism of self or of ad- 
mission of guilt were drawn from the MMPI. 
This scale included such statements as: “I 
have one or more habits which are so strong 
that it is no use fighting against them,” “I 
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have several times given up doing things be- 
cause I thought too little of my ability,” and 
“T believe my sins are unpardonable.” The 
items of this scale were allocated by the au- 
thor and agreed upon by the senior psycholo- 
gist at this department. 

At a less direct level, a development of a 
method of measurement described by Watson, 
Pritzker, and Madison (1955) was employed. 
This method consisted of 60 sets of scrambled 
words which could be assembled in either of 
two ways by leaving out one word in each set. 
One completion led to a hostile sentence and 
the other to a neutral. These authors found 
that a neurotic group made significantly more 
hostile completions than did a nonneurotic 
control group. In the test constructed for the 
present study, 48 sets of four scrambled words 
were compiled. Paralleling the MMPI scales, 
in 16 of the sets one completion led to an 
acting out hostile response and the other to 
a neutral; in 16, one completion led to a pro- 
jected hostile response (as defined above for 
the MMPI scale) and the other to a neutral, 
and in 16, one completion led to a self-criti- 
cal/guilt sentence and the other to a neutral 
one. Examples: 


acting out hostility 
HIT You 
I'LL TAKE 


projected hostility 
THEY'D ME 
CHEAT FIND 

self-critical/guilt 
COURAGE NO 
BOOKS I'VE 


The order of the words was randomized in 
making up the sets. The number of possible 
combinations was so great that the effect of 
a methodical approach by the S, such as al- 
ways reading along the top line from left to 
right, was reduced to a minimum. 

In administering this Sentence Building 
test, the card with the four scrambled words 
on it was laid in front of the S, who was in 
structed to give the first sentence he could 
make up as fast as he could, by leaving one 
word out. The emphasis was on reaction time, 
with a stop watch prominently displayed and 
each time noted down. The number of acting 
out hostility constructions, projected hostility, 
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and guilt construction given first was noted 
for each S. This may be regarded as a projec- 
tive type of test in which perceptual factors 
such as the “selection of personally relevant 
cues,” to use Bruner’s (1951) phrase, may 
be operative. The word or phrase frequency 
effect, the problem first raised by Solomon 
and Howes (1951), should be the same for 
both groups. 

At a more indirect level still, eight The- 
matic Apperception Test cards (TAT), 
namely 4, 3, 13MF, and 18GF of the stand- 
ard set, and four specially prepared pictures 
were used. These cards were selected be- 
cause they were thought to suggest aggressive 
themes. Administration was that described by 
Foulds (1953), involving the four questions, 
“what is the situation,” “what led up to it,” 
“how are they thinking and feeling,” and 
“how does it finish up?” The protocols were 
then scored for hostility and guilt content by 
two psychologists independently and without 
knowledge of the Ss’ identity. 

A detailed scoring system was devised for 
this purpose. For the hostility scoring, the 
mention of murder or death through accident 
or war was given a score of 3. Stories of physi- 
cal assault, injuries dangerous to life, or rape 
were given scores of 2. Minor injuries, verbal 
assault, or the mention of interpersonal fric- 
tions were given scores of 1. For the quilt 
scoring, the mention of suicide scored 3, re- 
morse with reparation 2, and remorse with- 
out reparation 1. If several of these themes 
were mentioned in one picture the highest 
score was taken. These highest scores were 
then summed for the series of eight pictures 
to get a total score. Since a correlation of .98 
for ratings of hostility and .95 for guilt were 
obtained, the ratings of the senior psycholo- 
gist were accepted as working scores. 

These unusually high correlations may be 
explained by the avoidance of interpretation. 
The raters had, in the main, merely to check 
whether certain statements had or had not 
been made. Some difficulty arose, however, in 
deciding whether a particular word or expres- 
sion could be treated as synonymous with re- 
morse. 

At the most indirect and ambiguous level, a 
tapping psychomotor test was administered. In 
a study by Foulds, McClelland, McClelland, 
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and Creasy,? the Tapping test was found to test. Since, in these studies, hysterics and 
differentiate significantly between paranoid paranoids scored relatively high on tapping 
and nonparanoid schizophrenic groups. In a and on various measures of hostility, the 
recent investigation by Foulds and Caine Tapping test was introduced here to investi- 
(1958), female dysthymic patients were suc- gate further its possible relationship to hos- 
cessfully distinguished from hysterics on this tility. In the administration here employed 

2G. A. Foulds, W. J. McClelland, Marilyn Mc- ‘he 5 was merely instructed to tap on a stand- 
Clelland, and Monica Creasy. Personal communica- ard sized piece of plain white paper, with a 
tion. 1958. pencil, as fast as he could for 10 seconds. 


Table 1 


Mean Score Comparisons of Melancholics and Paranoid States on Age, Vocabulary, MMPI Scales, 
Sentence Building Test, and TAT Hostility and Guilt Ratings 


Item N Mean SD t p 


Age 
Melancholics 17 51.12 6.48 
Paranoids 50.78 8.92 


Mill Hill Vocabulary 
Melancholics 17 26 
Paranoids 2 


Sentence Building test 
Acting out hostility 
Melancholics 17 7.12 1.62 
Paranoids 7 
Projected hostility 
Melancholics 17 5.35 
Paranoids 21 
Guilt 
Melancholics 
Paranoids 
MMPI hostility 
Acting out hostility 
Melancholics 17 4.88 2.47 
Paranoids 5 


10.00 
6.79 
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Projected hostility 
Melancholics 17 4.47 3.53 
Paranoids 10.07 
MMPI self-criticism and guilt 
Melancholics 17 1 
Paranoids 
TAT 
Hostility 

Melancholics 12 3.50 2.90 

Paranoids 


Guilt 


Melancholics 12 
Paranoids 


MMPI depression 


Melancholi 1] $4.65 12.74 
Paranoids 14 53.43 7.18 8.392 <.001 


ai 
36 4.87 333 >1 
O88 > 1 
2.323 < O05 
3.733 <.001 
891 >.1 
3.182 <.01 
$.807 < 
2.150 <.05 
| 
1.57 1.88 079 >.1 7 
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This was done three times. The spread of the 
resulting pencil dots was then determined by 
counting the number of half inch template 
squares entered for each trial, and the aver- 
age for the three taken. 

Since the present battery of tests was mainly 
of a verbal nature a minimum of Grade IV 
(i.e., above the 10th centile) on the Mill Hill 
Vocabulary Scale (Raven, 1954) was set. As 
a diagnostic check the depression scale of the 
MMPI was included. 


Subjects 


Three chronic hospitalized paranoids and 
all paranoid and melancholic female patients 
fulfilling the verbal level criterion who en- 
tered Runwell Mental Hospital during the 
period September 1957 to February 1958 
were considered for testing. Of those entering 
the hospital, three paranoids could not be in- 
cluded since one refused to cooperate, one 
was deaf, and one had been leucotomized. 
Two of the melancholics had to be rejected 
as untestable. Care was taken not to include 
any “mixed” cases of paranoids with depres- 
sive features or melancholics with paranoid 
features, since these are to form the basis of 
a separate investigation. In all, 14 paranoids 
and 17 melancholics were tested, but only 12 
of the latter took the TAT. 


Results 


It wag considered that the distributions of 
scores on age and tests, with the exception 
of tapping, were sufficiently normal to justify 
the use of a ¢ test comparison and the results 
are set out in Table 1. For the Tapping test 
the nonparametric Mann-Whitney U Test 
(for Ne between 9 and 20) was used. 

It will be seen from Table 1 that there 
were no siguificant differences between the 
groups in age or vocabulary level. 

Of the specially devised MMPI scales, the 
acting out hostility scale failed to differentiate 
between the groups. On the other hand, the 
paranoid group had significantly higher scores 
on the projection of hostility scale and sig 
nificantly lower scores on the self-criticism 
guilt scale. Supporting this finding, at the 
more covert level of response of the Sentence 
Building test, the acting out hostility sen- 
tences failed to differentiate, but both the 


Paranoid Women 21 
projected hostility and guilt sets have done 
so in the same direction as the MMPI scales. 
At the most covert level, the paranoid TATs 
were given higher hostility ratings than were 
the melancholics. At this level the guilt meas- 
ure failed to differentiate. The Tapping test 
was, surprisingly, the most successful diag- 
nostic test of the battery. The median score 
of the melancholics on this test was 2, that 
of the paranoids was 21. The score rankings 
gave a U value of 22, yielding a p of <.002. 


Discussion 


This study supports the contentions of 
Siegel that the varying directions of hostility 
(if guilt be regarded as introjected hostility) 
can be objectively measured. Siegel (1957) 
suggests that individuals may vary more in 
the manner and direction in which they ex- 
press hostility than in the amount of hostility 
they possess, and it may well be that the ap- 
proach used in the present study could be 
used to clarify this point. The value of dis- 
tinguishing various aspects of hostility and 
guilt has been demonstrated and further logi- 
cal divisions are indicated if the study were 
to be extended to other groups. Two further 
divisions would be to separate a general “criti- 
cism of others” from complaints of the direct 
receipt of hostile acts on the part of others, 
and a separation of self-criticism (Alexander’s 
inferiority feelings?) from admission of guilt 
and wickedness. 

The use of different levels of expression 
might well be a measure of the amount of 
hostility possessed. Normal, neurotic, and psy- 
chopathic g oups would be expected to differ 
from psychotics and between themselves, not 
only in the aspects of hostility and guilt ex- 
pressed, but also in the level of expression. 
One suspects that the consistency at all levels 
of response might not hold for neurotics and 
normals, and that this may well indicate the 
profundity and pervasiveness of the feelings 
the individual. Before this can be at- 
tempted, however, improvements in the pres- 
ent battery of tests would be required, par- 
ticularly at the covert levels. It was not 
possible in the TAT thus administered, for 
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3 This has now been done for a number of neu- 
rotic and psychotic groups. A paper is being pre- 
pared for publication. 
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example, to isolate different aspects of hos- 
tility satisfactorily, and since an expression 
of guilt tended to accompany acts of hostility 
in the stories, this measure of guilt failed to 
differentiate between the groups. This may 
well be more culturally than emotionally de- 
termined. It is of considerable interest that 
although the paranoids avoided acting out 
hostility on the MMPI scales and Sentence 
Building test to the same degree as the melan- 
cholics, they gave free rein to their feelings 
on the TAT, in which stories of murder, rape, 
violence, and quarrelling were marked. 

With the Sentence Building test there was 
considerable loss of distance. Patients often 
related their response directly to themselves, 
denying or confirming the truth of a state- 
ment made out of the words. That many of 
the patients were aware of the possibility of 
alternative responses was obvious from the 
remarks to this effect, from such comments 
as, “Oh, I couldn’t say that,” or exclamations 
followed by a neutral sentence. It can be ar- 
gued that the paranoid patient believes that 
his only way of obtaining release from the 
hospital lies in convincing the staff of the 
justification of his position and attitude. An 
admission of inadequacy, failure, or guilt on 
his part may seem damaging to his case. 
Hence his reluctance to voice sentences ex- 
pressing these feelings, which seem to him, 
through loss of distance, to have personal 
relevance. The melancholics, on the other 
hand, are anxious to impress one with their 
wickedness, and, indeed, are often worried 
that the staff may disbelieve their protesta- 
tions of unworthiness, failure, and guilt. 
Hence their choice of guilt sentences. A ta- 
chistoscopic presentation, to facilitate a closer 
examination of perception and report with re- 
gard to this test, is indicated. . 

Little can be said of the Tapping test be- 
yond the observation that the group express- 
ing least hostility had lower tapping scores 
as measured by the number of template 
squares entered. 

An explanation of this finding might be 
offered, perhaps, in terms of the psychomotor 
slowing of depressed patients, rather than of 
hostility. An elaboration of the scoring of this 
test might be helpful in clarifying this point, 
since the clinical impression in administering 
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the test was that force rather than speed had 
more bearing on the results. Some means of 
recording the number of taps and the pres- 
sure of tapping would be a useful addition. 


Summary 


Seventeen melancholic patients and 14 
paranoid states were given a battery of psy- 
chological tests purporting to measure the 
expression of some aspects of hostility and 
guilt at various levels of response, from an 
overt, direct expression of attitude to a covert, 
indirect one. Consistent differences were found 
between the groups at all levels. A number 
of problems worthy of further investigation 
have been discussed. 
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THEORY AND MEASUREMENT OF SOCIALIZATION * 


HARRISON G. GOUGH 


University of California, Berkeley 


A sociological conception ‘of a continuum 
of socialization, running from persons of ex- 
emplary probity and rectitude at one end, 
through persons of more typical and less 
beneficent coadunations of positive and nega- 
tive propensities, to persons of frankly errant 
and wayward impulse can easily be deline- 
ated. The precise location of a particular in- 
dividual or group along this continuum may 
pose something of a problem, but the existen- 
tial nature of the continuum itself and the 
fact that persons can and do make ready and 
reliable judgments in reference to it cannot 
be gainsaid. 

An adequate theory of socialization must 
pay attention to this continuum and be in 
consonance with it. This admonition also ap- 
plies to any procedures of measurement em- 
ployed in behalf of the theory. That is, a scale 
of measurement for “socialization” should po- 
sition individuals in the “asocial,” “normal,” 
and “‘supernormal” zones of the continuum in 
general accordance with the verdict which the 
sociocultural environment has handed down 
concerning them. 

If the scale of measurement is psychologi- 
cal, appealing to factors of individual re- 
sponse and calibrated on such reactions, then 
the problem is a psychosociological one, as 
it were, requiring the scaling of the psycho- 
logical dimension in such a way as to covary 
with the sociological one. Discrepancies are 
of course to be expected in individual in- 
stances between the sociological baseline and 
the psychological measurement, if for no other 
reason than that the culture will occasionally 


1 Earlier work on this program of research was 
conducted under a series of grants from the National 
Institute of Mental Health, U. S. Public Health 
Service. Since 1953 the program has been supported 
by two basic research grants awarded by the Ford 
Foundation 


make mistakes in putting some men in prisons 
and others in positions of trust and responsi- 
bility. One of the benefits to be derived from 
a psychological method for assessing the so- 
cialization continuum is that errors of this 
kind can be identified. 

For a number of years the author has been 
concerned with a theoretical formulation of 
the socialization problem which attempts to 
do justice to this notion of the continuous 
nature of the function (Gough: 1948, 1954; 
Gough & Peterson, 1952). Perhaps a brief 
summary of the theoretical position would be 
in order here. Socialized behavior is behavior 
based on a proper viewing of the self as a so- 
cial object, where the terms self and social ob- 
ject are used in the sense defined by Mead 
(1934). That part of the personality which 
links an individual to the social community 
is the “self.” The sense of self, or view of 
self, is a product of social interaction and of 
the capacity of the individual to view him- 
self as an object; that is, from the standpoint 
of the other. The self, therefore, has its roots 
in role taking, in a developing objectitication 
of the critiques and evaluations of oneself. In 
early life these views of self (called “me's” 
by Mead) may be discrete and unintegrated, 
but in time a certain communality and con- 
sistency in the patterns permits the evolve- 
ment 4f a conception of the “generalized 
other” which represents social reality as wit- 
nessed by the self. 

Such a process of introjection of societal 
standards is always coupled with a degree of 
uncertainty or independence in the expression 
of the self. This unpredictable element in the 
self is called the “I” by Mead. On the other 
hand, social adaptability, cooperation, and 
socialization itself are functions of the “me’s,” 
that is, of the role taking experiences and the 
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role taking capacities. Thus the degree to 
which the person will be able to govern in- 
ternally his thought and behavior in accord- 
ance with the imperatives of his culture will 
be a consequence of the depth and validity 
of the role taking experiences that he has 
enacted. 

In‘an earlier paper (Gough, 1948) the au- 
thor tried to show how this theoretical posi- 
tion could be brought to bear on the problem 
of psychopathy, and how the various symp- 
tomatic expressions of psychopathy could be 
deduced from the fundamental propositions of 
role theory. Later (Gough & Peterson, 1952) 
an attempt was made to develop a measuring 
instrument for asocial behavior using ques- 
tionnaire materials derived on thé basis of 
the role theory and focusing on the domain 
of social interaction and role taking experi- 
ences. Diagnostic items pertaining to the self 
view and role psychology were assembled and 
then administered to experimental samples, 
male and female. These included high school 
students, nondelinquent disciplinary problems 
in high school, and institutionalized delin- 
quents. Statistical analysis permitted the 
identification of 64 items (from some 200 
evaluated) having significant correlations with 
the socialization criterion. 

These 64 items were grouped into a scale 
which was then cross-validated on a military 
sample of 1,092 inductees vs. 99 stockade 
prisoners, and a second military sample in 
which 144 prisoners with two or more offenses 
were compared with 209 first offenders. The 
difference between the mean of the inductees 
and that of the prisoners was 7.26, critical 
ratio 11.52. For the first offenders vs. reci- 
divists the difference between means was 2.45, 
critical ratio 3.31. The signs in each instance 
were in the predicted direction, but even with 
use of the two-tailed test both critical ratios 
give probabilities well under .01. 

However, these two cross-validational cita- 
tions cover only two points on the socializa- 
tion continuum: delinquents vs. nondelin- 
quents and first offenders vs. repeaters. It 
must also be shown that socialized behavior 
can be brought under the purview of the 
theory and its scale of measurement to the 
same extent as asocial behavior. A scale or 
theory which would apply only to the distinc- 


tion between delinquents and nondelinquents 
would be a limited one indeed. The goal of 
the theory, on the contrary, is to encompass 
the full range of phenomena implied by the 
continuum and the goal of the scale is to lo- 
cate persons and groups in their proper places 
in any of its zones. 

To test the validity of the scale in making 
such differentiations, a greater variety of sam- 
ples was needed than reported in the 1952 
paper, especially samples in the “more so- 
cialized”’ regions of the continuum. Such sam- 
ples have gradually been obtained over the 
past seven years, in a sufficient number it 
would now seem to warrant analysis and re- 
porting. 


Shortening of the Scale 


As a first step, the 64-item scale developed 
in the 1952 study was item analyzed against 
the first offender vs. repeater dichotomy. Such 
an analysis follows logically from the theo- 
retical conception of socialization—asocializa- 
tion as a continuum, and from the contention 
that individual items as well as the full scale 
should possess validity at different regions of 
this continuum. Ten items were eliminated 
by the analysis, leaving 54 in the shortened 
scale.2 The following are representative of 
those retained (shown with the scored re- 
sponse for socialization). 


1. Before I do something I try to consider how 

my friends will react to it. (true) 

I often think about how I look and what im- 

pression I am making upon others. (true) 

I would rather go without something than ask 

for a favor. (false) 

4. I find it easy to “drop” or “break with” a 
friend. (false) 


Because the purpose of the scale is to position 
either individuals or groups along the basic 
underlying socialization continuum, it has 
been designated “So” for socialization. A defi- 
nition of the psychological implication of the 
scale can be given as follows: “To indicate 
the degree of social maturity, integrity, and 
rectitude which the individual has attained.” 


2 This 54-item version is the one included in the 
California Psychological Inventory (Gough, 1957). 
The CPI Manual gives a great deal of clinical and 
psychometric information about the scale 
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Table 1 


Means and Standard Deviations on the California Psychological Inventory So 
(Socialization) Scale for Male Samples Indicated 


More Socialized 


Sample 


Nominated high school 
“hest citizens” 
. Medical school 
applicants 
. Banking executives 


Regional wholesale flo 
salesmen 


. City school officials 


Jusiness executives 

College students 

. Civil Service suy 
personr ] 

. Electronic 


Correctional officers 


ery 


— 


ed workers 


hool students 


NUN 


. Social work graduate 
students 
Military officers 
Machine « 
Psychology graduat« 


students 


nerators 


Selective service 


inductees 


M: — M: = 8.76 


CR = 48.94; 


New Samples 


The present findings are based upon an en- 
tirely new series of 41 research samples, to- 
talling 1:295 male delinquents and 9,001 non- 
delinquents, and for females 784 delinquents 
and 9,776 controls. The samples cover a wide 
spectrum of the socialization continuum, rang- 
ing from nominated “best citizens” through 
various occupational and professional groups, 
through disciplinary samples, to known de- 
linquents and prison inmates. 

Table 1 lists the male samples tested and 
presents summary statistics on the 54-item So 
scale. The samples are ranked by mean score 
on So, with “more socialized” or nondelin- 
quent groups on the left, “less socialized” or 


Less Socialized 


Sample 


. High school disciplinary 
problems 
. County jail inmates 


. Prison inmates, New 
York 
. Young delinquents, 
California 
. Prison inmates, 
California 
Training school inmates, 
New York 
Inmates, federal 
reformatory 


73. 


delinquent on the right. The highest mean 
score among the 25 samples is observed in 
that of the nominated best citizens, and the 
lowest in the sample of federal reformatory 
inmates. In general, the rank ordering of sam- 
ples by the So scale seems to accord quite 
well with what would occur if they were 
ranked sociologically for socialization. It 
should be noted that all seven of the sam- 
ples carrying some explicit designation of 
asocial behavior score below the mean of the 
lowest scoring of those samples not so desig- 
nated. Comparison of the 9,001 cases in the 
more socialized group of samples with the 
1,295 cases in the less socialized group gives 
a difference between the means of 8.76, criti- 
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cal ratio 48.94, P < .001. The biserial corre- 
lation with the So scale for this same di- 
chotomy is +.73. 

Quite clearly, the So scale locates male 
samples along the socialization continuum in 
the way required by the underlying theory. 
The question now is, can this same differen- 
tiation be demonstrated for women? Table 2 
presents summary statistics on the CPI So 
scale for 15 female samples. The progression 
of mean scores for the female samples in 
Table 2 shows the same correspondence to the 
underlying socialization continuum as was 
previously observed for the male samples. 
There is again a perfect separation between 
the nine samples in the more socialized col- 
umn and the seven less socialized samples 
manifestly designated, in one way or another, 
for some defection from the socialization 
norms. For both the male and female sam- 
ples the validity of the specific placement of 
a sample on the socialization continuum by 
the average So score might be disputed, but 
the general correspondence between the psy- 
chometric ordering and the sociological hier- 
archy is remarkable. The statistical tests are 
also significant for the female samples, with 
a difference between the means of the more 


More Socialized 


Sample N M SD 

1. Nominated high school 90 41.51 4.55 
“best citizens” 
2. High school students 5295 39.69 5.55 
3. College students 3452 39.37 5.05 
4. Factory workers 291 38.99 4.76 
5. Psychiatric aides 67 38.70 3.91 
. Nurses 


7. Airline hostesses 60 38.07 4.51 

8. Social work graduate 320 37.99 4 38 
students 

9. Psychology graduate 59 36.44 3.93 


students 


Total 


9776 


39.46 


Table 2 
Means and Standard Deviations on the California Psychological Inven 
Socialization) Scale for the Female Samples Indicated 


vs. less socialized cases of 9.52, critical ratio 
37.83 (P < .001), and a biserial r of +.78. 

If one is willing to accept these results as 
indicating the validity of the So scale as a 
measure of socialization, some interesting ob- 
servations can be made about the relative 
standings of the groups tested. For example, 
there is the bemusing finding that psycholo- 
gists (University of California, Berkeley, psy- 
chologists, it might be cautioned) tend to 
rank rather low on the socialization con- 
tinuum, although fortunately not quite so low 
as to fall into the outrightly troublesome re- 
gion of the dimension. The sample of unmar- 
ried mothers stands midway in socialization 
between high school disciplinary problems 
and county jail inmates. In both male and fe- 
male rankings incarcerated persons score at 
the low end of the scale. It should also be 
noted that female samples in nearly all in- 
stances score significantly higher than their 
male counterparts. 

The grouping of cases in Tables 1 and 2 
into more socialized and less socialized is 
maintained in Table 3, where percentage fre- 
quency distributions are presented. Some in- 
dication of possible cutting scores and their 
screening efficiencies for the usual delinquent 


Less Socialized 


Sample N | SD 

1. High school disciplinary 87 34.79 7.00 
problems 

2. Unmarried mothers 213 32.92 6.24 

3. County jail inmates 31 29.61 5.86 

4. Prison inmates, Indiana 27 28.37 6.24 

5. Prison inmates, Cal 135 28.36 5.68 

fornia 
Prison inmates, Wis 76 26.83 7.04 


consin 
7 Young deli: 95 25.83 13 


California 


784 


29.94 
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Table 3 
Proportion of Cases Scoring at or above the Indicated 
Scores of the California Psychological Inventory So 
(Socialization) Scale for Ss Classified as More 
Socialized and Less Socialized 


Males Females 


More 
So Scale Socialized 


Score N = 9001 


More 
Socialized 


Less Less 


Socialized Socialized 


N = 1295 


N =9776 N = 784 


42 40 
4) 
40 


vs. nondelinquent dichotomy can be gained 
from this table. 

However, one might still ask whether any 
practical validity of the So scale for making 
the two-category classification in mass testing 
might not be vitiated by a base rate limita- 
tion. That is, if Category A (delinquency) is 
quite rare, the rules of inverse probability 
might make it more valid (accurate) in the 
long run simply to predict that everyone 
tested will be non-A. As Cureton (1957) has 
shown, this problem can be handled by proper 
setting of cutting scores so as to allow for 
base rate asymmetries. The optimal cutting 
score for predicting a dichotomous criterion 
(delinquent vs. nondelinquent here) from a 
continuous variable (the So scale) is the point 
of intersection of the smoothed frequency dis- 
tributions of the two classes on the predictor 
variable, where the areas of the two distribu- 
tions are proportional to the base rates. 

The question in our case is what base rate 
frequencies to use? As an example, suppose 
it is assumed that the true incidence of de- 
linquent personalities is 10% of the popula- 
tion. Using Cureton’s method for smoothing 
the distribution curves for the four samples 
in Table 3 and checking for the defined in- 


tersections leads to the specification of a cut- 
ting score of 23 (and below) for delinquency 
in males, and of 26 (and below) in females. 
If the true incidence of individuals sufficiently 
asocial to be classed as delinquent personali- 
ties is at least 1 in 10, then diagnostic classi- 
fications of all men with scores of 23 and be- 
low as delinquent types, and as nondelinquent 
types those with scores of 24 and above, will 
be more accurate than the 90% accuracy 
level which would automatically follow from 
classifying everyone tested as a nondelinquent. 

With a more misanthropic view that delin- 
quent personalities in reality constitute as 
much as 20% of the population, the optimum 
So cutting score for males is at 25 or below 
for delinquency. For females, the one in five 
assumption determines a cutting score of 28 
(and below). 

Perhaps two comments should be inter- 
jected before closing this section. The first is 
that the possibility of deriving dichotomies of 
the above type does not in any way alter the 
fundamental theory of the scale and of its 
psychosociology as representing a continuous 
socialization function. The second is that 
specifications for optimum use of the scale 
in other categorization problems could just as 
easily be made, for example, for super-citizens 
vs. all others. 


Work of Others 


A number of other investigators have car- 
ried out studies using the CPI So scale which 
help to illuminate the relationship between 
the measure and the theoretical continuum of 
socialization hypothesized to underlie it. One 
of these studies is that by Clark Vincent of 
unwed mothers.* The So scale data were gath- 
ered for 232 subjects (Ss), tested in several 
public and private maternal care centers. For 
this sample the mean So score was 32.25, SD 
6.58. These figures closely approximate those 
for the smaller sample of 213 reported in 
Table 2 earlier. 

The theoretical rationale for the So scale, 
however, requires a more detailed analysis of 
this sample, an analysis attentive to the gra- 
dations of the continuum. In the present 

8 Permission to use these data was very kindly 
given by Vincent, Univer. of California, in his book, 
Unwed Mothers, to be published. 
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instance socialization gradations can be re- 
flected by the number of pregnancies of each, 
S. Classification on this basis gave the follow- 
ing So results: 


Group N M SD 
1. One child 201 «33.60 5.65 
2. Two children 18 24.39 5.98 
3. Three or more children 13 22.23 4.16 


The progression of means here is that speci- 
fied by the theory of the scale, and is sup- 
ported statistically by the significance of the 
F test over the three means. Groups 2 and 3 
can be combined into a sample of 31 Ss hav- 
ing had two or more illegitimate children. 
The biserial correlation for this sample vs. the 
201 females having only one child is +.83. 
This coefficient offers rather striking evidence 
of the power of the So scale to discriminate 
within a certain range of the continuum. 

The mean scores listed above can also be 
reviewed for hypotheses about the proper 
location of each sample on the socialization 
continuum. The Ss having one child score at 
about the same point as high school discipli- 
nary problems; Ss having two or more chil- 
dren fall into a distinctly lower region of the 
dimension, into the range populated by the 
delinquent and criminal samples. 

The next study to be considered is that of 
Donald (1955). He administered the So scale 
to 230 consecutively admitted federal re- 
formatory inmates. The mean of 27.77 (SD 
6.53) places the total sample in its proper 
position on the continuum. The question for 
analysis is again that of differentiation within 
the total sample..Five of Donald’s compari- 
sons can be reported. The first is one in which 
socialization theory would not predict a dif- 
ference, a comparison between white and Ne- 
gro inmates. This expectation was confirmed, 
as the two part-samples did not differ signifi- 
cantly. The mean for 56 Negro inmates was 
29.39, for 174 white inmates 27.75; however, 
the ¢ test of the difference was not significant. 


A second of Donald’s comparisons was be- 


tween 135 Dyer Act commitments, M = 25.96, 
SD = 6.14, and the remaining 95 cases, M = 
30.34, SD = 6.19. Dyer Act offenses are in 
general more serious, and this difference is 
therefore in the predicted direction; the ¢ 
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ratio of 5.3 is significant well beyond the .01 
level. 

Comparison 3 pitted 111 boys with zero or 
one previous commitment vs. 119 boys with 
two or more. The means and standard devia- 
tions were 29.72 and 6.27 vs. 25.95 and 6.23. 
The difference is in the expected direction, 
and the ¢ ratio of 4.5 is again statistically 
significant. 

The fourth contrast was between boys 
whose first commitment occurred at age 15 or 
before vs. those whose first commitment came 
at age 16 or later. Socialization theory would 
predict lower So scores for the former group. 

Findings were in accord with these expecta- 
tions; for the 96 boys in the first group M 
24.76, SD = 5.89, and for the 134 in the 
second M = 29.93, SD = 6.09. The ¢ ratio 
for this difference was 6.4, P < .01. 

The last comparison is a novel one, con- 
trasting boys committed for ‘‘moonshining” 
infractions vs. all others. Federal liquor law 
violations lead to federal institution commit- 
ment, but from a socialization standpoint 
seem more akin to troublesome or moderately 
‘wayward behavior than to the asocial nature 
of most felonies. Therefore, one would expect 
higher So scores for this special sample. Such 
indeed was the case, for the 18 “moonshiners” 
had a mean score of 32.44, SD 6.99, and the 
212 remaining inmates had a mean of 27.37, 
SD 6.33. This difference is also statistically 
significant, the ¢ ratio being 2.9. The absolute 
level of the So mean, 32.44, should also be 
noted. It would rank the “moonshiners” just 
above “high school disciplinary problems” on 
the socialization hierarchy (see Table 1). 

The third study to be mentioned is that of 
the Pilot Intensive Counseling Organization 
Project (1956) being conducted at the Deuel 
Vocational Institution in California under the 
auspices of the California Youth Authority 
and the California State Department of Cor- 
rections.* In the PICO project, youth au- 
thority wards are assigned on a random basis 
at intake to experimental and control samples, 
the experimental cases participating in an in- 
tensive counseling program. The aim of the 

*The data reported here were very generously 


made available by Sherwood Blair, PICO project 
supervisor, and Alvin Rudoff, research coordinator. 
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project is to discover the effects of this coun- 
seling program, hoping to find that the ex- 
perimental cases will show better parole and 
postinstitutional records. The data reported 
here are all drawn from cases in the control 
sample. 

The CPI is administered to all Ss in the 
study at the time of admission to the training 
school. From the project records three sam- 
ples were established for our use: (1) parolees 
receiving suspensions during the first four 
months of parole; (2) parolees suspended 
during the fifth or a later month; and (3) 
parolees still free of suspension after five or 
more months on parole. The presumed sociali- 
zation gradient is from lowest (1) to highest 
(3). CPI So scale statistics for these samples 
are as follows: 


Sample N } SD 
1. Suspended within four months 90 5.10 
2. Suspended after four months 90 28.18 5.36 
3. Five or more months of parole 226 5 
without suspension 


78 


The progression of mean scores is in the di- 
rection specified, but the F test falls short of 
statistical significance. The follow-up period 
in the PICO project was not far enough along 
at the time of this writing to permit desig- 
nation of a “successful discharge from parole” 
sample; the third group above is only an ap- 
proximation of such a sample. If comparison 
is limited to early suspensions on parole 
(Sample 1) vs. later suspensions on parole 
(Sample 2), a critical ratio of 1.18 for the 
difference between the two means is obtained. 
Using the one-tailed test of significance, ap- 
propriate here because the difference is in the 
predicted direction, the probability level is 
between .05 and .10. 

The fourth reference to the work of other 
investigators concerns the program of studies 
being conducted by Reckless and Dinitz and 
their associates at the Ohio State University 
(Dinitz, Kay, & Reckless, 1957; Reckless, 
Dinitz, & Kay, 1957; Reckless, Dinitz, & 
Murray, 1957). In the Reckless, Dinitz, and 
Murray study, 125 boys living in high de- 
linquency areas in Columbus, Ohio, were 
nominated by teachers as being “insulated” 
against delinquency. These nominations were 
confirmed by a brief social service check of 


school and police records. The mean So scale 
score for this sample was 39.43, SD 6.42. 
Only one sample in Table 1, the high school 
best citizens, had a higher average score. In 
another study (Reckless, Dinitz, & Kay, 
1957), these 125 boys were compared with 
101 boys from the same high delinquency 
areas nominated by their teachers as being 
delinquency prone. The So scale statistics for 
this second sample were M = 31.40, SD= 
7.99. The difference between this and the 
earlier mean was highly significant. 

The paper by Reckless, Dinitz, and Mur- 
ray (1957, p. 568) also contains a comment 
bearing directly on the view of socialization 
as a continuous dimension and on the ca- 
pacity of the So scale to differentiate within 
regions of this dimension: 


The delinquency vulnerability and social responsi- 
bility scales did more than differentiate between the 
potentially delinquent and non-delinquent nominees. 
The scales also discriminated significantly, within 
the sample of potential delinquents, between those 
who had and those who had not experienced previ- 
ous police and court contact. 


The “delinquency vulnerability” scale referred 
to in the quotation is the CPI So scale dis- 
cussed throughout the present paper. The 
“social responsibility” scale is another one of 
the scales included in the full set of 18 in the 
California Psychological Inventory. 


Summary 


Arguments in behalf of viewing socializa- 
tion as a continuous dimension rather than 
as merely a dichotomy of social vs. asocial 
behavior were presented. A theory of sociali- 
zation was summarized, and a method of 
psychological measurement consonant with 
this theory was described. 

The greater portion of the paper was then 
taken up with findings seeking to demonstrate 
the validity of the measuring scale along the 
full range of the socialization continuum. 

Evidence from the writer’s inquiries as well 


5In the Ohio State study the scale was scored in 
the opposite direction, so as to emphasize the delin- 
quency vulnerability implications. The mean values 
reported in the present paper, however, have been 
reconverted to So scale form, so that higher scores 
are indicative of higher socialization levels. 
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as from those of other investigators gave 

strong support to the systematic validity of 

the scale. 
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A COMPARISON OF QUANTIFIABLE RORSCHACH 
ANXIETY INDICATORS IN HYPNOTICALLY 
INDUCED ANXIETY AND NORMAL 
STATES 


ZUGENE E. LEVITT 


Indiana University Medical Center 


HANUS J. GROSZ 
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Most Rorschachers believe that the Ror- 
schach test is a fairly sensitive indicator of 
anxiety in the subject (S). However, there is 
much disagreement on specific anxiety signs, 
with the possible exception of the shading re- 
sponse. Eichler (195la) listed 15 factors, 
which one or another expert alleged to be an 
indicator, but almost every Rorschach factor 
which can be quantified has been employed 
in this fashion in some investigation. The 
usual design of such a study is to contrast 
performances of groups of psychiatric pa- 
tients and normal individuals. In general, 
these studies have tended to produce equivo- 


cal and often conflicting results. Interstudy 


differences in administration and scoring tech- 
niques may have caused these discrepancies in 
part, but the most probable confounding fac- 
tor is sampling. There appears to be so many 
characteristics of the S which can influence 
Rorschach performance that the matching of 
patients and normals to eliminate bias is 
extremely difficult to accomplish. Successful 
matching is likely to be accomplished by pure 
accident, and the experimenter cannot be ex- 
pected to know when this has occurred. 

The most obvious way around the sam- 
pling problem is to use the same Ss for both 
the normal and anxiety performances. This 
introduces a new problem: how to induce 
anxiety in normal individuals. Many tech- 
niques have been used (cf. May, 1950) but 
their efficacies are usually questionable, if 
not downright dubious. 

The use of hypnotic suggestion to produce 


emotional states in normals has been reported 
periodically in the literature, and recently by 
Eichhorn and Tracktir (1955a, 1955b). The 
results suggest that this approach is rela- 
tively successful, and that it could be utilized 
to induce anxiety. 


Procedure 


The basic design of the study was to compare the 
Rorschach records of normal Ss as they would be 
obtained under ordinary circumstances with those 
obtained in an hypnotically induced anxiety state. 
In order to control for the effects of an hypnotic 
State itself on test performance, a record in hypno- 
sis without anxiety was also obtained. 

The Ss were 12 medical and nursing students, six 
males and six females. All Ss were volunteers and 
were paid for participation in the study. They had 
been selected from a pool of such Ss on the basis of 
two criteria: capacity to develop amnesia for perfor- 
mance in hypnosis, and general psychological adjust- 
ment. All Ss scored within usually accepted normal 
limits on the Taylor Anxiety Scale and the Barron 
Ego Strength Scale and were screened by psychiatric 
and medical examinations. 

The three records were obtained from each S on 
a single day in the following order: in the hypnotic 
condition, in hypnotically induced anxiety, and, 
finally, in the waking state. The S was made amnesic 
for the blots after each of the first two performances. 

Diffuse, nonspecific anxiety was induced in each S 
by a psychiatrist (H. J. Grosz) using a detailed sug- 
gestion in which the words “anxiety,” “fear,” “ap- 
prehension,” and “panic” were used. This use of a 
number of general synonyms for anxiety represents 
an effort to circumvent the effects of different pri- 
vate meanings of the single word “anxiety” among 
the Ss. The Ss were told that they did not know 
what was making them anxious. After the induction 
of anxiety, all Ss verbalized anxiety feelings and all 
showed some physical manifestations commonly as- 
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Table 1 
Factors Treated by Covariance Analysis 
(Influence of R Removed) 


Factor F ratio rwith R 
W 6.49** —.10 
Dd 1.02 
F (—-FM 2.88 ar 
F (+FM) 1.71 .79** 
F+ 5.42* 65° 
M 0.12 —.04 
FM 5.36* 23 
Sum C 0.51 34 
V 0.06 25 
Sp 0.88 29 
P 1.83 50* 
Tot H 2.05 .52* 
Tot A 0.49 62** 
Anat 0.45 
FY 1.83 73** 
FC 1.97 .28 
CF 1.65 24 
Hd 1.57 58** 

0.45 33 


Ad 


* Significant at the .05 level. 
** Significant at the .01 level. 


sociated with anxiety, such as trembling, agitation, 
pallor, etc. Ratings of the degree of anxiety were 
made independently by the two authors for each S, 
using a five-point scale on which a rating of one 
represented zero anxiety as in the waking state. The 
interrater agreement, as reflected in a tau coefficient 
of 47, is not high, but the raters did agree that 
every S showed some anxiety above what they had 
previously manifested. The mean. rating by the psy- 
chiatrist was 3.04, SD = .82. The psychologist’s mean 
rating was 2.92 with an SD of .83. 


Results 


A total of 25 Rorschach factors were ana- 
lyzed. For 20 of these, the treatment was a 
covariance analysis in which the influence of 
R, the number of responses, was held con- 
stant.! The F ratios among mean scores for 


1The use of a covariance analysis to control for 
the effects of R has been suggested by Eichler 
(1951b) as the most effective, economical method 
when W is small. Since the distributions of many 
Rorschach factors obtained from unselected samples 
as well as from many types of selected samples often 
are markedly skewed, a transformation is ordinarily 
required before applying the covariance approach. 
ae was not true in the present study. Most of the 
distributions tended to be relatively symmetrical 
i 


Adjusted Condition Means 


Waking Hypnosis Anxiety 
7.16 981 7.36 
2.12 1.85 2.71 
15.19 14.17 11.80 
17.89 18.45 16.23 
8.63 8.51 6.02 
3.60 3.41 3.64 
2.70 4.28 4.51 
2.52 2.23 4.05 
5.24 5.99 5.91 
1.73 1.79 1.96 
3.23 4.13 3.72 
6.39 6.77 5.52 
5.43 5.48 7.26 

10.04 11.23 10.73 
2.15 2.80 2.55 
3.83 3.75 4.91 
4.17 2.96 3.37 
3.03 4.12 3.43 
.* 2.11 2.90 
2.24 1.72 1.96 


the three conditions, the correlation with R, 
and the adjusted condition means for these 
20 factors appear in Table 1. Three factors— 
R, F+%, and median reaction time of first 
response—were treated by variance analyses; 
the data are in Table 2. There were several 


about a maximum ordinate near the center of the 
distribution. Means and medians coincided roughly 
A transformation was therefore not indicated. The 
primary explanation of the absence of skewness ap- 
parently lies in the relatively limited intelligence 
range in our sample. Most Rorschach variables, es- 
pecially determinants, show some relationship to in- 
tellectual factors; this is reflected in their correla- 
tions with R, as shown, for example, by Fiske and 
Baughman (1953). R is itself heavily dependent 
upon intelligence; as Beck (1954, p. 53) remarks, 
“the response total is one of the test’s best indices 
of intelligence in liberation.” The relatively high 
mean R in our group is a second explanation of the 
absence of skewness. This mean would fall at about 
the 70th percentile of the R distribution for either 
the normals or patients in the Fiske and Baughman 
study. What this means, simply, is that our Ss pro- 
duced more scores above 2 on various factors, and 
fewer of the scores of 0 and 1 which form an impor- 
tant causal component of the skewed distributions 
usually obtained with Rorschach factors. 
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factors whose incidence was too low and 
whose distributions were too skewed to per- 
mit covariance analyses. Two of these, M— 
and pure C, were components of broader 
factors which were found in the covariance 
analyses to be unrelated to R. These factors 
were analyzed by means of the Cochran Q 
Test (Siegel, 1956) for correlative propor- 
tions with more than two conditions. These 
results are shown in Table 3; the figures un- 
der the respective conditions show the num- 
ber of Ss whose records had at least one of 
the factors. 

There are significant differences among the 
three conditions on 6 of the 25 factors: W, 
F+, FM, Sum Y, median reaction time, and 
M—. The inferences are clear-cut with re- 
spect to F+, Sum Y, reaction time, and M—, 
since in each of these three conditions the 
significance is a function of a difference be- 
tween the anxiety state on one hand and the 
other two states on the other. In the case of 
W, the conclusion is that its incidence is fa- 
cilitated by hypnosis and is then depressed 
by the anxiety condition. We may conclude, 
then, that as far as the results of this study 
are concerned, these five indices reflect anx- 
iety. In the case of FM, the significance is 
due to a common increase in both hypnosis 
and anxiety, with no difference between the 
two. The inference must be that the increased 
incidence is a result of the hypnotic state it- 
self, and not of anxiety. It is of incidental in- 
terest that the F+ which is significant is not 
F+% in the conventional sense but F+ as 
a function of the total number of responses. 
The conventional F+%, which appears in 
Table 2, was not significant, though it is in 
the right direction. Of the 15 factors sug- 
gested as anxiety indices by Eichler (1951a), 


Table 2 


Factors Treated by Variance Analyses 


Condition Means 


Factor F ratio Waking Hypnosis Anxiety 


R 1.61 
F+% 
Mdn. RT (secs.) 


34.3 31.0 
1.09 89 87 
4.83* 24.4 18.0 


* Significant at .05 level. 


Table 3 


Factors Treated by Cochran ( Test 


Factor Incidence 
(N of Ss) 


Chi Square Waking Hypnosis Anxiety 


Factor 
8.40* 1 2 6 
3.43 1 3 5 


M- 
Pure C 


ificant at .05 level. 


12 appear in this study. Of these, two—W 
and Sum Y—significantly differentiated the 
anxiety state (interestingly enough, this was 
also Eichler’s result). Four others—R, Dd, 
Hd, and P—are in the right direction. The 
remainder—M, Sum C, Ad, Total A, anatomy 
responses, and F—are unpredictive. Of inter- 
est is the fact that F, as in the Eichler study, 
seems to be reduced rather than increased in 
anxiety. Also as in the Eichler study, this 
seems to be a function of the increase in 
shading responses. The data of Table 1 also 
show that it makes very little difference 
whether F is scored to include animal move- 
ment responses a la Beck, or without them as 
in the Klopfer system. However, the fact that 
FM seems to be affected by hypnosis suggests 
that it has some merit in the general analysis 
of personality. 

About half of the factors treated by co- 
variance analyses are significantly related to 
number of responses, despite the small num- 
ber of Ss. This is, of course, not unexpected. 
Fiske and Baughman (1953) reported that 
all factors were related to R in a large sam- 
ple of outpatients, and all but a few in the 
Spiegel normal sample on which Beck’s 
(1950) norms are based. The absence of cor- 
relation of many variables in our study is 
probably due to the attenuated range of R, 
which is itself due to the limited intelligence 
range in the sample. It is interesting to note 
that the correlation of .50 between R and 
number of populars is very close to the coeffi- 
cient of 48 reported by McCall and Doleys 
(1955) for the Spiegel sample. 

Even though our results are based on a 
small group of Ss, they may nevertheless be 
more revealing than those of larger scale, con- 
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trasting groups studies. Despite other limita- 
tions, there is no sampling bias involved in 
the selection of the “contrasting groups” in 
our investigation. Whether the use of indi- 
viduals of relatively high intelligence and 
hypnotic susceptibility restricts generaliza- 
tions from our results is a question requiring 
further study. Certainly the mean scores are 
not representative. This does not, however, 
mean that the change in mean scores from 
state to state might not be characteristic of 
a broader population. 


Summary 


Rorschach records were obtained from 12 
carefully selected, normal individuals in the 
waking state, in hypnosis, and in hypnoti- 
cally induced anxiety. Appropriate analyses 
of 25 quantifiable Rorschach factors indicate 
that decreases in W and F+, and increases 
in Sum Y, reaction time of first response, and 
incidence of M— reflect the anxiety state. An 
increase in FM was found to characterize the 
hypnotic state per se, but not the anxiety 
state. 
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THE SPIRAL AFTEREFFECT TEST (SAET) AS A 
PREDICTOR OF NORMAL AND ABNORMAL 


ELECTROENCEPHALOGRAPHIC RECORDS 


THEODORE H 


In the psychological assessment of cortical 
damage, the Spiral Aftereffect Test (SAET) 
has become an important and controversial 
technique. This technique was first introduced 
into the literature in 1949 by Freeman and 
Josey. Further pioneering of the SAET was 
conducted by Standlee (1953) and Price and 
Deabler (1955). 

The illusion, or aftereffect, following rota- 
tion of the Archimedes Spiral has long been 
known and used in experimental psychology 
(Boring, 1942). Research on the SAET is 
summarized in Table 1. 

It is noted that the mechanical technique 
used in presenting this test has varied consid- 
erably. Some research indicates high validity 
in differentiating subjects (Ss) with cortical 
damage from normals and functional states 
(Price & Deabler, 1955). Some research raises 
question as to the usefulness of this test in 
diagnosing brain damage (Gilberstadt, Schein, 
& Rosen, 1958). Researchers have had to con- 
struct their own apparatus, and no two stud- 
ies have utilized identical apparatus and pres- 
entation. These factors, as well as differences 
in sampling technique and criteria, may ac- 
count for the variation in research results. 
For instance, in evaluating the apparatus 
over a considerable period of time, it was dis- 
covered'that any deviation of the motor shaft 
caused ambiguous aftereffect perception. 

The importance of base rates in predicting 
valid measurements of cortical damage using 


1The authors wish to. acknowledge the consider 
able part played in this study by Arturo G. Gonzalez, 
whose skill and interest made the detailed criterion 
data possible. 

2 Now in private practice, Tampa, Florida. 
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the SAET has been reported in recent litera- 
ture (Stilson, Gynther, & Gertz, 1957; Gil- 
berstadt et al., 1958). 

Berger, Everson, Rutledge, and Koskoff 
(1958) conducted an elaborate study using 
extensive clinical criteria. Results indicated 
negative correlation when the SAET was com- 
pared with EEG records, pneumoencephalo- 
grams, and skull X-rays. Three studies are 
reported using children as the experimental 
sample (Spivack & Levine, 1957; Davids, 
Goldenberg, & Laufer, 1957; Harding, Glass- 
man, & Helz, 1957). The results seem to in- 
dicate the usefulness of the SAET in the 
measurement of cortical involvement in these 
children. 

Of 16 studies reported, 10 have used four 
trials, and the remainder, from three to six 
trials in presenting the revolving spiral. Yates 
(1954), in his evaluation of research measur- 
ing cortical damage, indicates this area has 
been studied most inadequately. He refers to 
such criticism as: lack of comparable groups 
for validation studies, classification of corti- 
cal damage, the control of relevant factors 
such as age, sex, intelligence, subjective scor- 
ing systems, and adequate control of the test- 
ing procedures. 

The present study gave particular atten- 
tion to the above criticisms. 


Procedure 
Subjects 


Four hundred and twenty children between the 
ages of 5 and 16 years were examined during a pe- 
riod of two-and-a-half years at an outpatient psy- 
chological clinic. The variety of complaints and dis- 
orders ranged from requests for a general psycho- 
logical checkup to referrals for differential diagnosis 
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of what was tentatively described as childhood schizo- 
phrenia. All of these Ss were given various psycho- 
logical tests, but in each case a SAET of eight trials 
was administered. Figure 1 presents graphically the 
distribution of SAET scores for the cases. 

Experimental group. Fifty of the children shown 
in Fig. 1 were selected as the “predicted abnormal” 
group. These children did not perceive the aftereffect 
phenomenon on eight trials of the SAET. Four of 
the original sample were dropped from the study be- 
cause EEG evaluation was not obtainable. This re- 
sulted in an experimental group of 46 cases. The 
mean age in months for the predicted abnormal 
group was 122.98. There were 34 males and 12 fe- 
males included in the group. The average grade place- 
ment was at the 4.0 grade level. Included in the 
group were 37 cases where one of the initial com- 
plaints was poor reading ability, 13 cases where 
speech defect was part of the presenting picture, 20 
cases where the child was overly aggressive, and 22 
cases where the child was said to be withdrawn. 
Total complaints exceed 46 since several symptoms 
were frequently described for a single case. 

Control group. A matched control group was se- 
lected for this study. Twenty cases, matched for age, 
were selected from the sample shown in Fig. 1. All 
Ss in this group achieved a full score of eight on the 
SAET. Mean age for this group was 119.65 months, 
and the group included 11 males and 9 females. The 
average grade placement was at the 4.4 grade level 
Within this group were two children with severe 
reading problems, one with speech difficulty, three 
overly aggressive children, and one child who was 
said to be withdrawn. The remainder of the group 
were children who were seen for routine psychologi- 
cal evaluation. This group was designated the “pre- 
dicted normal group.” 
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Fig. 1. Distribution of Spiral Aftereffect test scores 
for 420 children referred for outpatient psychological 
services. 


Table 2 


Matching Data for the Experimental 
Group and Control Group 


Control mental 
Group Group 
(Predicted (Predicted 
Normal Abnormal 

EEG) EEG 


Variable N=2 N = 46 
Mean age in months 119.65 122.98 
Sex 
Males 11 24 
Females 9 12 


School placement, grade 44 40 


Table 2 presents the data for the experimental 
group and the control group. 


Predictor Battery 


Although the primary purpose of this study was 
the evaluation of the SAET, it was decided to in- 
clude some standard psychological instruments said 
to be related to the presence or absence of brain 
damage. 

The psychological tests selected for this study ar 
viewed as screening devices rather than diagnostic 
tools. It was hypothesized that psychological meas 
urements can discriminate children of essentially nor- 
mal cortical function from children having disrupted 
cortical function. 

The Spiral Afterefiect Test. The stimulus for this 
procedure was an Archimedes Spiral (920°), im 
printed on a 7-in. white disc, presented to Ss with 
a rotation speed of 80 rpm (5 rpm variability). The 
apparatus was battery operated and reversible (clock 
wise and counterclockwise rotation) .* 

Each S was seated approximately eight ft. from 
the spiral. The following instructions were given: 


This is a test of the way you sec. Please keep 
your eyes on the silver button in the center of the 
wheel. In a moment I will turn on the machine 
The wheel will go around and I will ask you what 
you see. Please keep your eyes on the silver but- 
ton even when I ask you questions. Do you under 
stand? 


At this point the machine was turned on. After 10 
sec., aS timed with a stop watch, the S was asked: 
“What is happening to the black line?™ In most 
cases the S responded with the appropriate impres 
sion. If a vague response was given, the S was asked 
“Ts the black line moving toward the center or away 


* Details of the apparatus are available from the 
Psychological Research and Development Corpora- 
tion, Tampa, Florida. 
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from the center?” In all cases, Ss were able to re- 
spond to this inquiry. At the end of 20 sec. the S 
was told: “In a moment I will turn the machine off. 
Please watch the wheel very carefully when I turn 
the machine off.” At the end of 30 sec. the apparatus 
was stopped and the S was asked: “What is now 
happening to the black line?” Any reported after- 
effect of opposite direction from the original effect 
was given a score of 1. A zero score was given if no 
aftereffect was reported. 

Eight trials were presented in an ABBABBAA 
order. 

Reliability of this test is well known (Freeman & 
Josey, 1949; Gallese, 1956; Price & Deabler, 1955) 
with test-retest and interjudge reliability having been 
found to be between .84 and .94 by various experi- 
menters. 

The Bender Visual Motor Gestalt Test. This well- 
known test has been described elsewhere (Bender, 
1938). Although commonly evaluated by clinical 
judgment, a five-point rating scale was developed for 
this study. Standard instructions were followed in 
the administration of this test. 

The five-point rating scale was applied as an “in- 
spection technique.” The judges examined the entire 
Bender record of each S and noted such discrepancies 
as distortion, collision (one figure running into an- 
other) and rotation (a figure turned from the nor- 
mal axis). Thus the test productions were rated in 
terms of their apparent equivalence to the original 
cards. Each S was given a score ranging between 0 
and 4, with O representing what would be consid- 
ered an essentially perfect production, with consid- 
eration for the age of the S (Bender, 1946). 

Two judges, each with several hours’ training on 
this technique, rated all of the cases in the study. 
Applying a raw data formula, a Pearson correlation 
of .77 was obtained. Thus, the reliability coefficient 
is significant well beyond the .01 level. A cutoff point 
was established on the rating scale whereby scores of 
0, 1, and 2 were considered to be normal and scores 
of 3 and 4 were considered to be abnormal. In com- 
paring the interjudge reliability using this method, 
agreement was obtained in 93% of the cases. 

Draw-A-Person Test. This was the standard clini- 
cal test in which the S was asked to “draw a per- 
son.” No further directions were given other than to 
encourage the § to complete the drawing. 

Each drawing was rated on the single variable of 
“tipping.” A perpendicular figure was considered to 
have a score of 0, while a “floating” figure, lying 
horizontal on the paper, was given a score of 4. 
Scores of 1 through 3 were assigned to various de- 
grees of rotation between the perpendicular and the 
horizontal. 

Two judges rated all of the drawings in the study. 
Applying the raw data formula to the judges’ rat- 
ings, a Pearson correlation of .75 was obtained. This 
is significant beyond the .01 level. A second reli- 
ability evaluation was done by setting a cutoff score 
between 2 and 3 on the five-point scale. Thus, scores 
of 0, 1, and 2 were considered to be predictive of 
normal cortical function. Scores of 3 and 4 were 
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considered to be predictive of abnormal cortical func- 
tion. Agreement was found on 85% of the ratings 
using this method. 

The Wechsler Scales. For the purposes of this 
study, several subtests of the Wechsler Intelligence 
Scale for Children (WISC) were administered to all 
of the Ss. The scales selected were, Digit Symbol, 
Block Design, Arithmetic, and Digit Span. An addi- 
tional Digits-Reversed Score was also included. These 
scales were administered and scored according to 
standard procedures. Except for the Digits-Reversed 
variable, weighted scores were used. 


Criterion 


Whereas all previous studies have dealt with known 
brain damaged groups or assumed non-brain-dam- 
aged groups, this study deals with a group of chil- 
dren who were supposedly non-brain-damaged, as a 
result of physical examination before coming to the 
outpatient clinic in which they were tested. Empiri- 
cal evidence indicates that about 10% of clinic pa- 
tients, even though physical examination is negative, 
have some cortical disruption which might be de- 
scribed as subclinical intracranial pathology. It was 
deemed necessary that this study would have an ob- 
jective neurologic evaluation, and the EEC was 
chosen as the preferred criterion measure. All mem- 
bers of the experimental and the control groups 
were referred for EEG evaluation. A two-hour EEG 
record was taken on each child. A standard eight- 
channel Medcraft instrument, CS7C Model D, was 
used. Tracings were obtained in all leads, and a por- 
tion of the recording was a sleep record 

Only one judge was used in this experiment, since 
in the locality of the study there was a neuropsy- 
chiatrist available who had extensive experience in 
EEG with children. Each child was rated by the 
neurologist on a five-point scale ranging from 0 to 
4, on the factor of focal findings vs. no focal find- 
ings. A second rating scale was administered to meas- 
ure diffuse findings vs. no diffuse findings. In deter- 
mining the final criterion, those cases which showed 
any EEG anomalies were called the “abnormal EEG 
group.” Those Ss whose EEGs resulted in no find- 
ings were considered to have normal cortical function 

Six months after the original study, the EEG trac- 
ings of 18 of the Ss, including both “predicted nor- 
mal” and “predicted abnormal,” were selected using 
a table of random numbers. These records were given 
to the original rater for blind re-evaluation. He was 
asked to rate each of these records in terms of find- 
ings vs. no findings. When these ratings were fin- 
ished, it was found that all 18 cases had been rerated 
exactly as the original ratings. Since an intrajudge 
correlation of 1.00 seemed highly unlikely, the re- 
sults were questioned. In order to insure a more 
careful evaluation of the reliability of the EEG rat- 
ings, 36 cases were selected for re-evaluation. Half of 
these were predicted normal and half were predicted 
abnormal. In all cases, identifying data were removed 
and the neuropsychiatrist rating. the cases re-evalu- 
ated the records with no idea of which were normal 
and which were abnormal. Between 6 and 18 months 
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Table 3 


Correlation of Various Predictors with EEG Criterion 
and Intercorrelation with Each Other 


Predictor Variable Spiral Bender DAP 


EEG (criterion) 
Spiral Aftereffect 84* 
Bender Gestalt 
Draw-A-Person 
WISC 

Digit Symbol 

Block Design 

Arithmetic 

Digit Span 

Digits Reversed 


* Phi coefficient. 
* Significant at .01% level or better (.01 level = .345; .05 
level = .166). 


had elapsed between the original ratings and this 
final rerating. 

The same criteria of findings vs. no findings was 
applied, and the resulting data yielded a phi coeffi- 
cient of .77. The neuropsychiatrist was able to re- 
rate the EEG tracings reliably in 87% of the cases. 
This sort of reliability is unusual in view of what 
has been found in the literature. It should be stated, 
however, that the EEG records taken on these chil- 
dren were more carefully done than one might ex- 
pect in the course of a usual EEG evaluation. The 
judge who rated this data had long experience in 
electroencephalography with children. A good deal 
of time was spent on the inspection of the records. 
It was determined that it would be impossible to 
evaluate predictors adequately if the criteria itself 
were unstable. 


Results 
The Spiral Aftereffect Test 


All Ss were given eight trials on the SAET, 
and scores of 6, 7, and 8 were considered to 
be predicted normal. A score of 0 through 5 
was considered to be predicted abnormal. 
This dichotomy was compared with the EEG 
ratings of findings vs. no findings. 


The Bender Visual-Motor Control Test 


Scores on the five-point scale of the Bender 
were separated into predicted normal (score 
of 0 or 1) and predicted abnormal (score of 
2, 3, or 4), and compared with the EEG 
criteria. 


The Draw-A-Person Test 


Scores on the five-point scale of the Draw- 
A-Person test were dichotomized: predicted 


normal (score of 0 or 1), and predicted ab- 
normal (score of 2, 3, or 4), and compared 
with the EEG criteria. 


The Wechsler Scales 


The raw scores on the Wechsler Scales were 
compared with the EEG criterion of findings 
vs. no findings using the point biserial corre- 
lation technique. 

Intercorrelations were calculated among the 
various predictor variables. Table 3 presents 
the correlations among the various predictors 
and the EEG criterion. Unless otherise indi- 
cated, point biserial correlation technique was 
used. 

It will be noted on Table 3 that the SAET 
had the highest correlation with the EEG cri- 
terion. The Bender, the Draw-A-Person test, 
the WISC Arithmetic, and the WISC Block 
Design all correlated beyond the .01 level of 
significance with the EEG criteria. 

Significant intercorrelations were found be- 
tween the SAET and the Bender, between the 
SAET and the Draw-A-Person test, between 
the SAET and the WISC Digit Symbol, and 
between the SAET and the WISC Arithmetic. 

The Bender correlated significantly, with 
the SAET, the Draw-A-Person test, the WISC 
Digit Symbol, and the WISC Block Design. 

The Draw-A-Person test correlated signifi- 
cantly with the SAET and the Bender. 

Table 4 presents the percentage of “hits” 
in predicting normal EEG records and abnor- 


Table 4 


Accuracy of Prediction of Normal“and Abnormal EEG 
Records by the Various Predictor Variable, in 
Percentage Hits 


Percentage of Percentage of 


Normal EEG 
Records 
Correctly 
Predicted 
(Hits) 


Abnormal 
EEG Records 
Correctly 
Predicted 
Predictor Variable (Hits 
Spiral Aftereffect test 1. 86 
Bender Gestalt F 70 
Draw-A-Person 17 
Digit Symbol Z 73 
Block Design 
Arithmetic a 
Digit Span 81 
Digits Reversed a 72 
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mal EEG records for all the predictor vari- 
ables. 

An attempt was made to analyze the char- 
acteristics of the EEG records. This was 
found to be difficult in that the different rec- 
ords were described with such a variety of 
terminology that statistical analysis was ques- 
tionable. Eight characteristics of the records 
were chosen in order to present some defini- 
tive data.* Table 5 presents this brief analy- 
sis of the EEG records. 


Discussion 


The data presented above would seem to 
indicate that the SAET and other psycho- 
logical tests are variously effective for dis- 
criminating children exhibiting abnormal elec- 
troencephalographic records from children 
who exhibit normal records. These findings 
support some of the earlier work done with 
the SAET. The SAET seemed to be particu- 
larly effective as a discriminating technique. 
The sample was considered normal by gen- 
eral physical examination. This raises the 
hypothesis that psychological tests are per- 
haps more sensitive indicators of minimal 
cortical disruption than the usual medical- 
neurological examinations. 

The present study differs in its findings 
from some of the previous research on the 
SAET, particularly one study which used 
EEG as the criterion. The EEG record must 
be considered an intermediate criterion rather 
than an ultimate criterion. There are a va- 
riety of ways and means by which the EEG 
may be used as a validating instrument. In 
the present study, it was used with the con- 
siderable care, and the reliability data pre- 
sented lends some support to this technique 
as an adequate criterion. The fact that it is 
not an ultimate criterion is evidenced by the 
fact that one of the children whose EEG rec- 
ord was rated “no findings” had a grandmal 
seizure in the clinic. It is conceivable, in view 
of suspected criterion fallibility and less than 
perfect intrajudge EEG reliability, that the 
predictability of the SAET as well as other 
psychological measurements may actually be 
better than the present data indicates. Since, 


8 Analysis of the EEG records continues and will 
be presented in a later paper. 


Table 5 


A Brief Analysis of Some Characteristics of the EEG 
Records of the Predicted Normal and the Predicted 
Abnormal Groups 


Predicted Predicted 
Normal Abnormal 
Group Group 


Mean occipital alpha rhythm : 8.2 
(per second) 


Mean rating of paroxysmal 2.06 


activity (rated 1-4) 


Mean rating of build-up under 2.31 
hyperventilation (rated 


1-4) 


Mean rating of dysrhythmia or 
disorganization (rated 1-4) 


Percentage of records with 
focal slow activity 


Percentage of records with 
spikes 
Single 
Multiple 
Spike & Dome 


however, we must be dependent upon an in- 
termediate criterion, all we can do is attempt 
to establish the criterion used with the great- 
est of care and attention to questions of reli- 
ability and validity. 

The most significant question in the evalua- 
tion of the SAET, as well as any other diag- 
nostic technique, is the question of base rate. 
This matter has been previously discussed in 
the introduction to this study. It was decided 
not to apply the base rate formula to the, 
present data since there is no information, 
available of random samplings of unselected 
EEG tracings in children between the ages of 
5 and 16 years. This would seem to be a very 
necessary area for further research. 

In judging the effectiveness of any instru- 
ment, one must consider its application. Psy- 
chological tests, when considered in terms of 
the presence or absence of brain damage, are 
screening devices. In and of themselves they 
cannot be diagnostic. The diagnosis of the 
presence of brain damage would be a medi- 
cal problem, whether it be through labora- 
tory tests, histological examination, or re- 
sults of treatment. In the present study, the 
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SAET was the best of a series of psychologi- 
cal tests for the prediction of EEG findings. 
Other psychological tests were significant but 
less accurate in their predictability. Undoubt- 
edly we will always be dealing with the ques- 
tion of false positives and false negatives. It 
is still a moot question whether errors in pre- 
diction are based on inaccuracies of the pre- 
dictor variables, or in the criterion itself. 
Much careful research is necessary to clarify 
these points. For the time being, however, 
since clinical psychologists are called upon to 
give tests and, as part of their testing pro- 
cedure, to make some determination of who 
shall be referred to the neurologist and who 
shall not be referred, the SAET, probably in 
combination with one or more standard psy- 
chological instruments, would seem to be the 
best available technique for this purpose. 


REFERENCES 


Aaronson, B. S. Age, intelligence, aphasia, and the 
spiral aftereffect in an epileptic population. J. clin. 
Psychol., 1958, 14, 18-21. 

Benper, L. A visual motor gestalt test and its clini- 
cal use. New York: American Orthopsychiat. Assn, 
1938. Pp. 176. 

Benver, L. Instructions for the use of visual motor 
gestalt test. New York: American Orthopsychiat. 
Assn., 1946. Pp. 4-5. - 

Bercer, D., Everson, R., Ruttepce, L., & Kosxorr, 
Y. D. The spiral aftereffect in a neurological set- 
ting. J. consult. Psychol., 1958, 22, 249-255 

Borinc, E. G. Sensation and perception in the his- 
tory of experimental psychology. New York: Ap- 
pleton-Century, 1942. Pp. 591-592. 

Davins, A., Gotpenserc, L., & Laurer, M. W. The 
relation of the Archimedes spiral aftereffect and 
the trail making test to brain damage in children. 
J. consult. Psychol., 1957, 21, 429-433. 


Freeman, E., & Josey, W. E. Quantitative visual in- 
dex to memory impairment. AMA Arch. Neurol. 
Psychiat., 1949, 62, 794-797. 

Gattese, A. J., Jx. Spiral aftereffect as a test of or- 
ganic brain damage. J. clin. Psychol., 1956, 12, 
254-258. 

Garrett, E. S., Pricer, A. C., & Deasirer, H. L. 
Diagnostic testing for cortical brain impairment. 
AMA Arch. Neurol. Psychiat., 1957, 77, 223-225. 

H., Scuem, J. D., & Rosen, A. Fur- 
ther evaluation of the Archimedes spiral after- 
effect. J. consult. Psychol., 1958, 22, 243-248. 

Govpperc, L. R., & Smitn, P. A. The clinical use- 
fulness of the Archimedes spiral in the diagnosis 
of organic brain damage. J. consult. Psychol., 
1958, 22, 153-157. 

Harpinc, G. F., Grassman, S., & Herz, W. C. Ma- 
turation and the spiral aftereffect. J. abnorm. soc. 
Psychol., 1957, 54, 276-277. 

Lonpon, P., & Bryan, J. H. The influence of in- 
structions on spiral aftereffect reports. Amer. Psy- 
chologist, 1958, 13, 7. 

Pace, H. A., Raxita, G., Kapran, H. K., & Smita, 
N. B. Another application of the spiral aftereffect 
in the determination of brain damage. J. consult. 
Psychol., 1957, 21, 89-91. 

Price, C. A. & Deaster, H. L. Diagnosis of or- 
ganicity by means of spiral aftereffect. J. consult. 
Psychol., 1955, 19, 299-302. 

Sprivack, G., & Levine, M. The spiral aftereffect and 
reversible figures as measures of brain damage and 
memory. J. Pers., 1957, 25, 767-778. (Mimeo.) 

Sprvack, G., & Levine, M. Spiral aftereffect, pa- 
rameters and correlates. Unpublished manuscript, 
Devereux Foundation for Res. & Trng., 1958. 

STANDLEE, L. S. The Archimedes negative aftereffect 
as an indication of memory impairment. J. con- 
sult. Psychol. 1953, 17, 317. 

Strrson, D. W., Gyntuer, M. D., & Gertz, B. Base 
rate and the Archimedes spiral illusion. J. con- 
sult. Psychol., 1957, 21, 435-437. 

Yates, A. C. The validity of some psychological 
tests of brain damage. Psychol. Bull. 1954, 51, 
359-379, 


(Received January 19, 1959) 


| 
| 
« 
ye 
‘ 
| 
| 
: 


Journal of Consulting Psychology 
Vol. 24, No.1, 1960 


A COMPARISON OF SHADING RESPONSES OBTAINED 
WITH TWO RORSCHACH METHODOLOGIES 
FROM PSYCHIATRIC AND NON- 
PSYCHIATRIC SUBJECTS’ 


PATRICIA F. WALLER * 


University of North Carolina 


Baughman has developed a new inquiry 
procedure for the Rorschach Ink Blot Test 
(Baughman: 1954, 1958). This procedure 
utilizes a paired comparison presentation of 
a series of modifications of the original blot. 
For example, if a subject (S) reported a 
flower garden on Card X, instead of the usual 
questioning on the inquiry, the S is presented 
the original card and an identical card in 
which the color has been eliminated. He is 
then asked if he can still see the flower gar- 
den in the achromatic card, in which card is 
it clearer, and why. The original card is then 
removed and replaced by a card in which the 
shading has been eliminated, and the ques- 
tioning proceeds as before. The cards con- 
tinue until there is only an outline of the blot 
remaining, but it is rarely necessary to con- 
tinue questioning throughout the entire series. 

It has been shown that this paired com- 
parison inquiry procedure elicits from non- 
psychiatric Ss many more shading responses 
than does the regular inquiry procedure 
(Baughman, in press; Marimon, in press). 
It appears that the questions involved in the 
paired comparison inquiry are easier to an- 
swer than those in the conventional inquiry. 
Rather than asking, “What is it about the 
blot that makes it look like a bat?” one asks, 
“Do you still see the bat here? Is it as clear? 
What makes it clearer over here?” Although 


1 This article is based upon a doctoral dissertation 
completed at the University of North Carolina, Au- 
gust, 1958. This paper was presented at the 1958 
meeting of the American Psychological Association, 
Washington, D. C. 

2Now at R. B. Jackson Memorial Laboratory, 
Bar Harbor, Maine. 


this technique takes slightly longer, it ap- 
pears that it does not demand so much from 
the S. The questions do not seem so difficult 
to answer. 


Method 
Problem 


If shading is truly related to anxiety, as has often 
been hypothesized (Beck: 1944, 1951; Binder, 1937; 
Hertz, 1940; Klopfer, 1937; Rorschach & Oberholzer, 
1942), then it would seem reasonable to assume that 
shading responses would be more prominent in a 
psychiatric population than in a _nonpsychiatric 
population. However, if it is also assumed that psy- 
chiatric patients are under greater stress, then it 
may be reasonable to assume that it would be more 
difficult for them to identify the stimuli giving rise 
to their behavior. Although they may be reacting 
to shading, the stress they are under may make it 
more difficult for them to report it. Cox and Sara- 
son (1954) have made this point and have actually 
found fewer texture responses in high anxiety Ss 
than in low anxiety Ss. An easier, more objective in- 
quiry may enable such Ss to express the shading de- 
terminant whenever they are using it. On the basis 
of this reasoning, it was predicted that psychiatric 
patients would show a greater difference in shading 
responses between the regular and the Baughman in- 
quiries than would nonpsychiatric Ss. 


Subjects 


Data from a previous study by Baughman (1959) 
provided information from a nonpsychiatric popu- 
lation.2 Ss were 162 employees of three insurance 
companies in a southern city. The psychiatric Ss 
were 60 patients from the inpatient and outpatient 
services in a hospital in a nearby town. Psychi- 
atric Ss were selected according to three criteria 
First, they were to be showing no obvious signs of 
psychosis. Second, there should be no question of or- 
ganic impairment, and third, they should be literate. 


3 The author is indebted to Emmett Earl Baugh- 
man, who made available data from his research. 
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Fig. 1. Cumulative distributions of shading re- 
sponses obtained from psychiatric and nonpsychi- 
atric populations, using two Rorschach methodol- 
ogies. 


Whether a patient was considered psychotic was de- 
cided by the psychiatrist in charge. In some cases, 
later evaluation led to a psychotic diagnosis, but in 
such instances the diagnosis could be termed debat- 
able, and the S was retained in the study. The psy- 
chiatric and the nonpsychiatric groups were compa- 
rable in terms of mean number of responses to the 
Rorschach (20.10 and 22.05, respectively) and me- 
dian education (high school graduate). 


Procedure 


The nonpsychiatric population consisted of two 
groups of 81 Ss each, one group receiving the regular 
inquiry and the other the paired comparison. The 
psychiatric population consisted of two groups of 30 
Ss each, one group receiving the regular inquiry and 
the other the paired comparison. The same examiner 
tested all the psychiatric Ss and many of the non- 
psychiatric, so that the method of administration 
vas comparable. All records were scored by two 
scorers. Although scoring included the various forms 
of Y, V, and 7, interscorer reliability was based on 
the sum of shading responses for each record, that 
is, the total number of responses involving shading of 
any kind. Interscorer reliability (Pearson r) ranged 
from .90 to .98. 


Results 


A comparison of the groups on the basis of 
the sum of shading responses is shown in 
Fig. 1. Because of the unequal Ns for the 
two groups, the results are depicted in cumu- 
lative percentages. Table 1 lists the means 
and standard deviations of number of shad- 
ing responses for the two inquiries with the 
psychiatric and the nonpsychiatric popula- 
tions. On the basis of the sum of shading re- 
sponses, both the psychiatric and the nonpsy- 
chiatric groups show significantly more shad- 
ing on the Baughman inquiry procedure (p 
< .01). However, the psychiatric population 
shows even more than does the nonpsychiatric 
population. A ¢ test of the difference between 
the differences was significant at <.01 level, 
confirming the hypothesis that the psychi- 
atric population would show a greater differ- 
ence between the regular and the paired com- 
parison inquiries on the basis of the shading 
response. 

The relative efficiency of the two inquiry 
procedures for discriminating between psy- 
chiatric and nonpsychiatric populations was 
tested. As can be seen in Fig. 1, the regular 
inquiry shows virtually no difference between 
the two populations, and where the differ- 
ences occur, they are the result of the non- 
psychiatric population giving more shading 
than the psychiatric population. The direc- 
tion of difference supports the theory of Cox 
and Sarason but is contrary to expectations 
based on orthodox Rorschach theory. How- 
ever, the differences between the groups on 
the basis of the regular inquiry are not sig- 
nificant (¢ = .69, 109 df). On the basis of 
the Baughman paired comparison inquiry, the 


Table 1 


Shading Responses for Psychiatric and Nonpsychiatric 
Populations on Each of Two Inquiry Procedures 


Inquiry 


Psychiatric 


Nonpsychiatric 


xX 
Regular SD 

N 

xX 
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psychiatric population gives more shading, as 
would be predicted from Rorschach theory. 
A t test of the differences was significant at 
<.02. 


Discussion 


The hypothesis that psychiatric Ss would 
show a greater difference between the regular 
and the paired comparison inquiries in regard 
to the shading response was supported by the 
results. Furthermore, the differences found in- 
dicated that the paired comparison inquiry 
differentiates between psychiatric and nonpsy- 
chiatric populations significantly better than 
does the regular inquiry. These results lend 
support to the assumption that psychiatric 
patients are more sensitive to the shading de- 
terminant but have more difficulty reporting 
it. In other words, the regular inquiry may 
not differentiate between Ss who give little 
shading because they are not utilizing it and 
Ss who give little shading because they are 
having difficulty determining the basis of their 
response. The Baughman paired comparison 
inquiry procedure makes it easier for both 
psychiatric and nonpsychiatric Ss to report 
the determinants involved in their responses. 
The fact that on the regular inquiry pro- 
cedure, psychiatric and nonpsychiatric Ss give 
essentially the same amount of shading, and 
on the easier Baughman inquiry psychiatric 
Ss give significantly more shading than non- 
psychiatric Ss, lends support to the common 
contention that shading is in some way re- 
lated to anxiety and should therefore be more 
prominent in a psychiatric population. 


Summary 


Sixty psychiatric and 162 nonpsychiatric Ss 
were compared on the basis of Rorschach 
shading responses obtained with two inquiry 
procedures, the regular and the Baughman 
paired comparison. It was predicted that the 
psychiatric group would show a greater dif- 


ference between the two inquiry procedures, 

since they should be responding more to shad- 

ing but have greater difficulty reporting it, 
because of the stress they would presumably 
be under. The results supported this hypothe- 
sis at <.01 level of significance. The results 
also indicated that on the basis of total num- 
ber of shading responses, the regular inquiry 
procedure failed to differentiate between the 
psychiatric and the nonpsychiatric groups, 
whereas the Baughman paired comparison in- 
quiry differentiated at <.02 level of signifi- 
cance. 
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THE RELATIONSHIP BETWEEN KLOPFER’S RORSCHACH 
PROGNOSTIC RATING SCALE AND PHILLIPS’ CASE 
HISTORY PROGNOSTIC RATING SCALE ' 


CLAUDENE SEIDEL 


George Washington University 


Clinical use of the Rorschach test has led 
to the observation that there is often an ex- 
treme discrepancy between the general noso- 
logical picture that a patient presents overtly 
and symptomatically, and the ego structure 
as revealed by the Rorschach. Some of the 
psychological implications of this observa- 
tion, according to Klopfer, are that the Ror- 
schach record usually reflects the individual's 
ego organization as it is shaped by his pre- 
morbid level of maturity as well as his ego 
defenses and character formation, and that 
the ego organization may not always be easily 
recognizable in overt symptoms (Klopfer, 
Ainsworth, Klopfer, & Holt, 1954). 

The maturity of the male patient may also 
be reflected in his history of sexual adjust- 
ment and marital responsibilities as well as 
in his relationships to parents and peers from 
childhood to adulthood. The psychiatric pa- 
tient’s premorbid maturity may not be readily 
apparent from his observable behavior at the 
time of admission to the hospital. Scales have 
been developed to deal with the patient’s un- 
derlying ego organization and premorbid so- 
cial and sexual history. 

Phillips (1953) has developed a rating scale 
which utilizes case history data for predicting 
the outcome of shock treatment in schizo- 


1 This paper is based on a thesis submitted to the 
faculty of the Columbian College of the George 
Washington University, Washington, D. C., in par- 
tial satisfaction of the requirements for the degree 
of Master of Arts. The author is presently a clinical 
psychologist with the Arlington (Virginia) Mental 
Hygiene Clinic. Acknowledgment is made to Albert 
D. Annis, formerly Director of Research in Psychol- 
ogy, St. Elizabeths Hospital, Washington, D. C., 
for his assistance in setting up the study, his advice 
throughout the course of the investigation, and his 
ratings on the Rorschach Prognostic Rating Scale. 
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phrenia. This scale includes three areas of 
case history data: the premorbid history 
(sexual and social adjustment), possible pre- 
cipitating factors, and signs of the disorder 
(affect and thought). Phillips found that, in 
general, case history items pertaining to the 
recent premorbid past (found in Sec. I of the 
scale) showed the clearest relationship to out- 
come with his group of patients. These items 
are: recent sexual adjustment, social aspects 
of the recent sexual life, past history of per- 
sonal relations, and the recent adjustment in 
personal relations. 

It was thought that Phillips’ scale and 
Klopfer’s Rorschach Prognostic Rating Scale 
(RPRS) (Klopfer, Kirkner, Wisham, & 
Baker, 1951) might be of value in determin- 
ing whether there is a relationship between 
the level of maturity reflected in a schizo- 
phrenic patient’s case history and his Ror- 
schach record, and also might be useful in 
predicting outcome of the illness. 


Method 
Sample 


The sample was selected from the file of regular 
referrals (for diagnostic evaluation) to the Psychol- 
ogy Branch of Saint Elizabeths Hospital, Washing- 
ton, D. C. (This is a mental hospital of over 7,000 
patients made up of voluntary admissions and com- 
mitments from all socioeconomic levels.) The cri- 
teria set up for the selection of the sample were: 

1. Only white, male schizophrenic patients were 
selected because the sample on which Phillips tested 
his scale was comprised of males. 

2. The patients in the sample group should have 
been given a Rorschach test within four months 
after their admission date. (Two patients in the 
sample group did not strictly meet this criterion; 
one was tested within five months of admission and 
the other within six months.) 
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3. The patients should have been hospitalized on 
or before May 1, 1954, so that an evaluation could 
be made of the outcome of their illness after three 
years (ie. on May 1, 1957). 

4. The Rorschach records used in this study were 
obtained before these patients had received tranquil 
izing drugs 

5. Only legal residents of the District of Columbia 
and Veterans Administration cases were included so 
that comparably complete case histories could be 
obtained for all members of the group. 

6. The age range was between 20 and 50. 

7. Only patients whose Rorschach records con- 
tained at least five responses were chosen for study. 

In order to select 100 cases it was necessary to 
choose all patients conforming to these criteria that 
were referred for testing between February 1945 and 
December 1954. 

The distribution of the sample group according to 
diagnosis (type of schizophrenia) is as follows: cata- 
tonic—33, paranoid—32, hebephrenic—S, simple—2, 
schizo-affective—2, undifferentiated—22, and type 
undetermined—4. The mean age of the sample was 
31.9 years. 


Rating Procedure 


Phillips’ Scale. The case histories were rated on 
Phillips’ scale by the author. Each rating was made 
on that portion of the case history that was obtained 
at the psychiatric case study interview and from rela- 
tives and friends at the time of admission. Later 
data, pertaining to the period of hospitalization, was 
not included in the scale ratings. 

The ratings on premorbid social and sexual his- 
tory (Sec. I of Phillips’ scale) were used in this 
study rather than the ratings on the remainder of 
the scale, because in Phillips’ study, Sec. I showed 
the most significant relationship to outcome of ill- 
ness. Preliminary correlations with the Klopfer rat- 
ings were found to be as high or higher when rat- 
ings on Sec. I of Phillips scale were used as when 
the ratings on the total scale were used. The cor- 
relation between Phillips I and Phillips total was 
found to be .96 for the sample group used in this 
study. 

Each patient was rated on five subsections of 
Sec. I and the average of these ratings was deter- 
mined. There are actually six subsections in Sec. I, 
but since two of these (social aspects of recent 
sexual life—30 years and above; social aspects of 
recent sexual life—below 30 years of age) are mutu- 
ally exclusive, each patient could be rated on only 
five subsections. 

Klopfer’s Scale. The ratings on Klopfer’s Ror- 
schach scale were made without knowledge of the 
scores obtained on Phillips’ scale.* 


2 Klopfer’s method for determining scores was fol- 
lowed with one exception. Klopfer subtracts 1.0 from 
the weighted form level score when there is a dis- 
crepancy of 3.0 or more between the lowest and 
highest form level ratings, provided the lowest form 
level rating is a minus score. In this study 1.0 was 


Outcome Criterion 


One of the stated purposes of this study was to 
learn whether Phillips’ scale and Klopfer’s scale 
might be useful in predicting the outcome of illness 
of schizophrenic patients. Two criteria were set up 
to investigate this question. The first of these was 
admission to the hospital on or before May 1, 1954, 
so that an evaluation of outcome of illness could be 
made after a three-year period. Secondly, those pa- 
tients that left the hospital prior to three years had 
to receive a psychiatric evaluation of Social Re- 
covery or Recovery. It was found in checking the 
records that 7 patients did not strictly meet the first 
criterion, so they had to be excluded from the prog- 
nostic evaluation. This left 93 patients that met the 
first criterion. Thirty patients did not meet the sec- 
ond criterion because they left the hospital prior to 
three years with a psychiatric evaluation of Im- 
proved or Unimproved. These 30 patients are a 
heterogeneous group. Some of them were released 
against medical advice, others were transferred to 
another hospital, and others were discharged be- 
cause they were considered to have received maxi- 
mum hospital benefit. Since these 30 patients could 
not be analyzed properly as to outcome of illness, 
they also had to be excluded from the prognostic 
evaluation (for reference purposes, the 37 patients 
that did not meet the criteria are called the “other” 
group). This left 63 patients who met the criteria: 
31 that were hospitalized continuously for three 
years or longer, and 32 that were discharged in less 
than three years with a psychiatric evaluation of 
Social Recovery or Recovery. 


Results 


Product-moment correlations were com- 
puted for the original sample group of 100 
cases and the prognostic evaluation group of 
63 cases. Table 1 gives the correlations be- 
tween the social and sexual history ratings 
(Phillips I) and: (a) Klopfer’s total (final) 
prognostic score ratings, (6) Klopfer’s form 
level ratings, and (c) Klopfer’s determinant 
ratings. The term “determinant rating” re- 
fers to the sum of the ratings on the first five 
sections of the RPRS, i.e., everything except 
form level. 

Point biserial coefficients were computed to 
determine the correlation between the Klopfer 
ratings and the dichotomous variable of So- 
cial Recovery or Recovery in less than three 
years, as against continuous hospitalization 


subtracted only when the lowest form level rating 
was —1.5 or lower. This exception was made be- 
cause with psychotic subjects it is often difficult to 
obtain a satisfactory inquiry and therefore specifica- 
tions of responses are difficult to evaluate. 
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Table 1 


Pearson Correlations between Klopfer’s RPRS Ratings 
and Phillips’ Social and Sexual History Ratings 


Prognostic 

Evaluation Sample 
Group Group" 

(V = 63) (N = 100) 


Original 


Between RPRS total and 
Phillips I 

Between RPRS form level 
and Phillips I 


Between RPRS determinants 
and Phillips I 


* Significant at .05 level. 
** Significant at .01 level. 


for three years or more. Point biserial coeffi- 
cients were also computed to determine the 
correlation between premorbid social and 
sexual adjustment and the same dichotomous 
variable. These coefficients are presented in 
Table 2. 

Chi square values were determined to learn 
whether there is a relationship between the 
Rorschach ratings and diagnosis, and between 
the social and sexual history ratings and diag- 
nosis, and between these ratings and age. The 
Klopfer total ratings were divided into five 
categories: High (scores of +2.61 and above), 
Medium High (+1.50 to +2.60), Medium 
(+.18 to +1.49), Medium Low (+.17 to 
—2.49), and Low (—2.50 and below). These 
categories were set up on the basis of Klop- 
fer’s Groups III, IV, and V (Klopfer et al., 
1951). The combined High and Medium 
High categories correspond to Klopfer’s Group 
III (better than 50-50 chance; any treatment 
will be of some help), the combined Medium 
and Medium Low categories to Klopfer’s 
Group IV (50-50 chance), and the Low 
category to Klopfer’s Group V (a difficult 
case that may be helped somewhat but is 
generally a poor treatment prospect). Since 
there was no comparable basis for grouping 
the social and sexual history ratings, they 
were divided into only two groups: above the 
median, and at the median and below. The 
variable of age was divided into two groups: 
below 30 years of age (i.e., 20-29), and 30 
years of age and above (i.e., 30-50). This 
point of division was chosen because Phillips’ 


scale is divided in this manner for one sub- 
section of Sec. I. 

The computed chi square for the relation- 
ship between the Klopfer total categories and 
diagnosis is 16.35, which is significant at the 
.05 level. The relationship between the social 
and sexual history ratings (Phillips I) and 
diagnosis shows a chi square value of 7.44, 
which is also significant at the .05 level. The 
computed chi square for the relationship be- 
tween the Klopfer total groupings and age is 
.70, which is not sufficient to be statistically 
significant. Neither is the relationship be- 
tween the social and sexual history ratings 
and age statistically significant; here the chi 
square value is 1.00. 


Discussion 


The present study represents two different 
approaches to the study of the schizophrenic 
patient. One approach stresses the importance 
of the past history of the individual, especially 
aspects of his premorbid sexual and social ad- 
justment. The other approach is primarily 
concerned with the patient’s present and po- 
tential ego strength, or adjustment level, as 
reflected by the Rorschach. 

The significant correlation between the rat- 
ings on Sec. I of Phillips’ scale and the total 
ratings on Klopfer’s scale supports the hy- 
pothesis of a relationship between the social 
and sexual history and the ego strength of 
schizophrenic patients. These findings are in 
agreement with the related findings of Fine, 
Fulkerson, and Phillips (1955) with normal 
subjects. These investigators found that the 
lower the social adequacy of the individual, 
the higher the degree of his maladjustment. 


Table 2 
Point Biserial Correlations between Ratings and Social 
Recovery or Recovery in Less Than 3 Years/Contin- 
uous Hospitalization for at Least 3 Years 


Ratings Correlations with Outcome 


Social and Sexual history 
(Phillips I) 

Klopfer total 

Klopfer form level 

Klopfer determinants 


46** 
40** 
44** 


** Significant at .01 level. 
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In the present study, the poorer the pre- 
morbid social and sexual adjustment, the 
lower the ego strength. 

According to the results obtained in this 
study, the social and sexual history of the 
schizophrenic patient is significantly related 
to his reality ties (as reflected by RPRS form 
level ratings) but is not significantly related 
to aspects of ego strength other than reality 
testing (reflected by determinant ratings on 
the RPRS). It seems likely that form level, 
as rated on the RPRS, reflects the more di- 
rect manifestation of a psychotic breakdown 
in ego strength, whereas the determinant rat- 
ings of the RPRS reflect, to a greater extent, 
the type of ego defenses and potential emo- 
tional resources of the individual. It is likely 
that the Phillips case history data reflect a 
manifest record of poor sexual and social con- 
tacts without adequate consideration of the 
underlying psychodynamics of sexual and so- 
cial disturbance which the determinant score 
would be more likely to reflect. It is therefore 
not surprising that a significant relationship 
was found between the premorbid social and 
sexual data and form level on the RPRS, but 
not between the sexual and social history and 
the Rorschach determinants. 

The results indicate that the RPRS deter- 
minant scores as well as the RPRS form level 
ratings are significantly related to the out- 
come of the schizophrenic illness. This sug- 
gests that the underlying defenses and poten- 
tial emotional resources of the individual as 
well as the extent of his deficiencies in reality 
testing are important in influencing the out- 
come of the illness. 

The significant correlations with outcome 
of illness suggest that ratings on social and 
sexual history, Klopfer form level, Klopfer 
total, and Klopfer determinants would all be 
useful in predicting the outcome of illness of 
schizophrenic patients.* The present study 
therefore contributes toward the validation of 
the RPRS by supplying further data regard- 
ing its use with schizophrenic patients. This 

It should be pointed out that the revision of 
Klopfer’s form level method (see fn. 2) in the pres- 
ent study probably slightly increased the correlations 


in which Klopfer form level or Klopfer total ratings 
are involved. 


study also confirms the results obtained by 
Phillips. 

Both the Klopfer total ratings and the so- 
cial and sexual history ratings show a sig- 
nificant relationship to diagnosis. The dis- 
tributions suggest that paranoid schizophren- 
ics received more favorable scores on both 
measures than did other subclassifications of 
schizophrenia. 


Summary 


One hundred male schizophrenic patients 
were rated on Phillips’ Prognostic Rating 
Scale and Klopfer’s Rorschach Prognostic 
Rating Scale and 63 could be followed up to 
determine their status as either discharged, 
recovered, or still hospitalized. 

1. Section I of Phillips’ scale was signifi- 
cantly related to RPRS ego strength (total) 
and reality testing (form level). 

2. No significant relationship was found be- 
tween Sec. I of Phillips’ scale and RPRS de- 
terminants. 

3. Section I of Phillips’ scale and RPRS 
ratings for ego strength (total), form level, 
and determinants were significantly related 
to social recovery. 

4. Section I of Phillips’ scale and RPRS 
ego strength were significantly related to diag- 
nosis (type of schizophrenic reaction). 

5. No significant relationship was found 
between the age of the patient and RPRS ego 
strength or between age and Sec. I of Phillips’ 
scale. 

6. Implications for predicting the outcome 
of illness of schizophrenic patients were dis- 
cussed. 
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THE EFFECTS OF CHEMOTHERAPY ON LENGTH OF 
STAY AND RATE OF RETURN FOR PSYCHI- 
ATRICALLY HOSPITALIZED PATIENTS’ 


ROBERT B. ELLSWORTH 
Veterans Administration Hospital, Ft. Meade, South Dakota 
anp WILLIAM H. CLAYTON 
Veterans Administration Hospital, Salt Lake City, Utah 


From the results of many well controlled 
investigations of the effects of ataractic. drugs 
on psychiatrically hospitalized patients, there 
can be little doubt but that many of these 
drugs act as a stimulus in modifying the 
psychotic’s behavior and symptomatology. As 
Staudt and Zubin (1957) point out, however, 
the important question is not so much whether 
the therapeutic agent stimulates an immedi- 
ate change, but whether the agent has any 
significant effect on longitudinal recovery 
rates. The use of control and experimental 
groups in an adequate double blind design, 
in which the experimental (drug) group was 
found to have improved significantly in com- 
parison with the control group, does not be- 
gin to answer the important question raised 
by Staudt and Zubin. In such studies, the use 
of the control group allows the researcher to 
reach conclusions regarding only the immedi- 
ate stimulus effect of the drug under investi- 
gation. 

Perhaps the best way to evaluate the over- 
all contribution of these drugs would be to 
compare the release rates, length of hospitali- 
zation, and return rates of two identical hos- 
pitals; one committed to a therapeutic in- 
volvement with the psychotic patient without 
drugs, the other relying on drugs as the ma- 
jor treatment method. Another possibility 
would be to compare the same hospitals’ dis- 
charge and return rates before and after the 
introduction and extensive use of drugs as a 


1The data for this study were gathered at the 
VA Hospital, Salt Lake City, Utah, under the sup- 
port of the Psychiatric Evaluation Project, Richard 
L. Jenkins, Director. 


treatment method. It is the purpose of this 
paper to present such comparative data in an 
effort to evaluate the effect of ataractic drugs 
on the over-all rehabilitation of the psychotic 
patient. 


Procedure 


The hospital under consideration is a small, well 
staffed Hahn-type Veterans Administration Hospital. 
The Hahn-type hospital was designed so that each 
ward operated as a small treatment unit (24 to 32 
beds), each with its own staff, dining facilities, and 
day rooms (Anderson & Fechner, 1956). Until very 
recently, each treatment ward in this hospital was 
staffed by a treatment team consisting of a psychia- 
trist, psychologist, social worker, and nurse. Only 
rarely did a treatment team have responsibility for 
more than two treatment units. The aide-patient 
ratio ranged from 1:2 for closed wards to 1:8 for 
some open wards (all 3 shifts). By the minimum 
suggested standards for NP hospital staffing ratios 
(American Psychiatric Association), this hospital was 
considered to be very adequately staffed. 

The two periods used to evaluate the effectiveness 
of the widespread use of ataractic drugs in this well 
staffed hospital were: July 1, 1953 to July 11, 1954 
(1 to 2 years prior to the introduction of ataractic 
drugs to the hospital); and July 1, 1956 to April 
29, 1957. A recent survey revealed that 89% of all 
psychotic patients were on ataractic medication con- 
tinuously from the time of their admission up to 
(and often beyond) the time of their discharge, and 
that 96% of all psychotics received ataractic medi- 
cation sometime during the course of their hospitali- 
zation. Thus, the major change in this hospital be- 
tween 1953-54 and 1956-57 was the introduction 
and widespread use of ataractic drugs in the treat- 
ment of the psychotic patient. 

The patients under consideration were the 200 con- 
secutive psychotic admissions between July 1, 1953 
and July 11, 1954; and the 200 consecutive psychotic 
admissions between July 1, 1956 and April 29, 1957. 
All patients were male, nongeriatric veterans who 
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Table 1 


Length of Hospitalization and Return Rates of 1953-54 (predrug) 


1953-54 
Predrug 

Median length of 
hospitalization of 116 days 


all patients (N = 200) 


Return rate to 
hospital within 1 
year of Hischarge 25.3% 
of all patients 

(N = 150) 


Median length of 
hospitalization of 
134 1953-54 patients 


and 134 1956-57 112 days 


patients matched on 
marital status and 
residence 


* Median test, chi square analysis. 


were diagnosed as functionally psychotic at the time 
of admission. The admission diagnosis of functional 
psychosis was used since an admission diagnosis of 
psychotic was 86% accurate as compared with the 
discharge diagnosis. Actually, the diagnosis of a re- 
covered patient, who was clearly psychotic at the 
time of admission, was often changed to one of non- 
psychosis upon discharge. The admission diagnosis 
thus appeared to be adequate for the purposes of the 
study. Because of the unreliability of the subdiag- 
nostic categories (Wehlman, 1952), the diagnoses of 
psychoses were not further broken down, although 
there appeared to be about the same proportion of 
affective psychotic states in both of the predrug and 
drug periods 

The conclusions regarding the over-all effectiveness 
of the ataractic drugs on the rehabilitation of the 
psychotic patient are based on both a comparison of 
the median length of hospitalization and the return 
rates of predrug patients and drug patients. The 
median length of hospitalization demonstrates the 
efficiency of the hospital system, but not necessarily 
the effectiveness of treatment. The return rates within 
a year following discharge of both the predrug and 
drug groups are used, therefore, to evaluate the effec- 
tiveness of ataractic drugs on the over-all rehabilita- 
tion of the psychotic patient. Since waiting for all 
patients to be discharged and followed for one year 
after discharge would have involved a delay of sev- 


eral years, the follow-up data were based on the: 


75% of patients who left the hospital within 292 
days of admission in the predrug group, and the 
75% of drug group patients who had left the hos- 
pital by 252 days from the time of admission. This 
40-day difference between the time it took 75% of 


Psychotic Admissions and 1956-57 (drug) Admissions 


1956-57 Significance of 
Drug Difference* 
110 days N.S. 


31.3% N.S. 


92 days N.S. 


the predrug and 75% of the drug group to be dis- 
charged represented a difference of only four pa- 
tients. In other words, on the 252nd day from the 
date of admission, 150 1956-57 patients had been 
discharged, while 146 1953-54 patients had been dis- 
charged. Thus, the rates of return to this hospital 
are based on a total of 150 1953-54 patients and 150 
1956-57 patients. The return rates to this hospital 
were found to underestimate the return rates of psy- 
chotic patients to this and other hospitals within 
one year of discharge by 7%.2 A careful personal 
contact follow-up of almost 100 psychotic patients 
revealed that of the patients who were rehospitalized 
within one year after discharge, the vast majority 
returned to the discharging hospital. 

The length of hospitalization and return rates to 
the hospital may be related to population character- 
istics and not necessarily to the type of treatment re- 
ceived. On all 400 patients in this study, data with 
regard to marital status, residence, service connec- 
tion, type of admission, etc. were analyzed in rela- 
tion to length of hospital stay and return to the 
hospital within one year of discharge. Married pa- 
tients were found to leave the hospital significantly 
sooner than single patients. Out of state residents 
remained significantly longer than residents. An ex- 
amination of the 1953-54 and 1956-57 psychotic pa- 
tient groups revealed that 47% of the 1953-54 ad- 
missions were married and 32% of the 1956-57 pa- 
tients were married. On the other hand, 2% more of 
the 1953-54 group were out of state residents. Con- 
sideration of both these factors indicated that the 


2 This follow-up is part of the research procedure 
of the VA Psychiatric Evaluation Project. 
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1953-54 group was likely to remain in the hospital 
a shorter period of time, based on the difference in 
the population characteristics alone. In the later 
presentation of the median length of hospitalization 
figures, 134 1953-54 patients are matched on marital 
status and residence with 134 1956-57 patients. This 
matching allows for a more accurate estimate of the 
effect of the introduction of ataractic drugs on the 
length of hospitalization. None of those variables 
were related to the return rates. The married patient, 
for instance, was just as likely to return to the hos- 
pital within a year of discharge as the single patient. 


Results 


From data presented in Table 1, there were 
no statistically significant differences in the 
median length of hospitalization of 200 pre- 
drug and 200 drug psychotic admissions, nor 
in the return rates of the 150 1953-54 (pre- 
drug) and 150 1956-57 (drug) discharged 
patients. At any one given time an average 
of 3% (total group) to 5% (matched group) 
more drug patients had been discharged from 
the hospital compared with the number of 
predrug patients. For example, within 100 
days of admission, 86 1953-54 patients had 
been discharged, while 92 1956-57 patients 
had been discharged from the hospital. On 
the average of 6 to 7 days later, however, an 
equal number of 1953-54 patients had been 
discharged from the hospital. 

The direction of the differences in Table 1 
suggests that the ataractic drugs resulted in 
a more efficient hospital operation but a less 
effective treatment program.® Since these dif- 
ferences were not significant, however, it was 
necessary to conclude that the use of ataractic 
drugs in the treatment of psychotic patients 
in a well staffed hospital has no demonstrable 
effect on the rehabilitation of the patient. 


Discussion 


The tangible within-hospital results of the 
use of ataractic drugs, however, are apparent. 
The wards are quieter, ECT has been re- 
duced from 54 patients treated in 1953-54 to 
16 patients treated in 1956-57. These changes 
have made it a more comfortable place for 


8 Many'of the 1956-57 patients were discharged 
and continued taking medication after leaving the 
hospital (based on trial visit reports). Even this ad- 
ditional factor of posthospital medication apparently 
made no difference in the over-all return rate of this 


group. 
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the staff to work, and perhaps a more desir- 
able place for petients to come. The fact re- 
mains, however, that the tendency of this 
well staffed hospital to rely so heavily on the 
ataractics has resulted in no measurable in- 
crease in over-all patient rehabilitation. On 
rounds and in staff conferences, the patient 
who is not improving is usually treated by a 
change in the dosage of his medication or by 
a change from one drug to another. 

One possible explanation of the suggested 
higher return rate for patients treated with 
drugs is that the hospital was admitting more 
chronic patients in 1954-57, and, therefore, 
could expect a higher return rate following 
discharge. In a recent study on the rehabilita- 
tion of “hard core” chronic patients who had 
been hospitalized elsewhere at least five years 
prior to transfer (Ellsworth, Mead, & Clay- 
ton, 1958), it was reported that over 50% of 
these patients, with extremely poor prognoses, 
had been discharged from the hospital. The 
return rate, to amy hospital within one year 
after discharge, was 31%. Since the return 
rate to this hospital reported for admission 
patients treated in the drug era (1956-57) 
was 31.3%, it would appear that chronic pa- 
tients have at least as good a chance of re- 
maining out of the hospital for one year as 
the more acute admission patients. It must be 
remembered that the 31.3% return rate of the 
1956-57 psychotic to this hospital underesti- 
mated the actual return rate to this and other 
hospitals by 7%. In light of this evidence, 
the conclusion that there is no essential dif- 
ference in patient rehabilitation as a result of 
the introduction of ataractic drugs is main- 
tained. 

Why are these drugs used and relied on so 
heavily in a well staffed psychiatric hospital? 
From the data of the present study, it is ap- 
parent that patients improved before the ad- 
vent of ataractic drugs at about the same 
rate as patients treated with these drugs. The 
heavy utilization of these drugs reflects the 
staffs’ conviction of their effectiveness, which 
can perhaps best be attributed to the error 
of post hoc ergo propter hoc logic. In other 
words, the staff assumes that since many pa- 
tients improved following the administration 
of an ataractic drug, the drug must be the 
causative agent of that improvement. Actu- 
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ally, of course, the administration of the drug 
was only one of several changes in the pa- 
tient’s life following his admission to the 
hospital. 


Summary 


1. The length of psychiatric hospitalization 
and return rates of two large groups of psy- 
chotic, male, nongeriatric patients admitted 
to a well staffed hospital were compared. One 
group (1953-54) was treated before the in- 
troduction of ataractic drugs. The other group, 
over 90° of whom were given ataractic drugs, 
was treated during the period 1956-57. 

2. Aside from such in-hospital effects as 
quieter wards and less ECT, there were no 
statistically significant differences in the me- 
dian lengths of hospitalization and the return 
rates of the two groups. 


3. A careful examination of the differences 
in population characteristics of patients ad- 
mitted during both years failed to account for 
the similarity of release and return rates dur- 
ing the drug and nondrug era. 
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The purpose of this study is to predict the 
performance of brain damaged individuals on 
the Rorschach test by utilizing a recently 
developed theoretical approach to Rorschach 
analysis. This approach is based on the ap- 
plication of the Haskell Coaction Compass 
(1949) to personality dynamics and has 
proved fruitful in a number of studies (Clarke 
& Lodge, 1955; Lodge, 1953; Lodge & Gib- 
son, 1952; Lodge & Guyer, 1953; Lodge & 
Steenbarger, 1953). The application of Has- 
kell’s Coaction method to psychology is based 
on the observation by Lodge and Steenbarger * 
that: 


Personality theory, historically as well as recently, 
has inclined to rest largely upon assumptions of dy- 
namic interplay between two sets of intrapsychic 
power systems. ... Such standard dichotomies as 
thought and feeling, reason and instinct, ego and id, 
inhibition and facilitation, socialized and primitive 
reactions, “consciousness” and “unconsciousness,” etc., 
all suggest interdependent power systems of roughly 
comparable meaning. 


Interpretation of the Rorschach test, derived 
as it was from Rorschach’s views on Jung’s 
psychoanalytic formulations, has also tradi- 
tionally relied on the assumption of two such 
intrapsychic power systems. Lodge and Steen- 


1 An earlier version of this paper was presented at 
the Southeastern Psychological Association Meeting, 
Atlanta, Georgia, April 29, 1958. This research was 
supported, in part, by grant B-1459 from the Na- 
tional Institute of Neurological Diseases and Blind- 
ness, United States Public Health Service. The au- 
thors gratefully acknowledge the helpful suggestions 
of G. T. Lodge. 

2 From “Relations of the Personalities of Scientists, 
Normal Persons, Neurotics, and Psychotics: A Geo- 
metric Analysis,” by G. T. Lodge and C. J. Steen- 
barger to be published in Volume II of E. F. Has- 
kell (Ed.), An Introduction to Unified Science. 


barger (1953) have identified these variables 
as Control and Affect and provided opera- 
tional definitions for them in terms of Ror- 
schach responses. 

The determinants of any given Rorschach 
response are considered to reflect the inter- 
action of these two variables, which are de- 
fined so that form level represents the strength 
of the control component, while color, shad- 
ing, vista, and movement determinants are 
considered expressive of affect.* Three levels 
of each are specified: weak (—), neutral (0), 
and strong (+), thus yielding nine possible 
qualitatively different relationships or types 
of coaction. 

Relationships between these variables can 
be visually represented conveniently on a 
cartesian coordinate system called the coac- 
tion compass (Fig. 1), whose X and Y axes 
represent, respectively, Control and Affect. 
Each response becomes a resultant vector of 
unit length, with its direction determined by 
the relative strength of the two coacting proc- 
esses. All response vectors in a given protocol 
may be geometrically combined to yield a 
single overall vector representing the deter- 
minant pattern of the entire record. This vec- 
tor may fall in any direction on the compass, 
due to the infinite number of possible inter- 
gradations between the nine coaction cate- 
gories, and its length may range between 0 
and 100% of the radius (the longer the vec- 
tor, the more consistent is a given response 
tendency and vice versa). 

3 The assumption of the orthogonality or independ- 
ence of the two basic variables of control and affect 
has been supported by the finding of substantially 
zero correlations between them in a representative 
sample of protocols sele'' d from previous coaction 
studies. 
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Vectors falling in the right half of the com- 
pass reflect good form level, those in the left 
half, poor form level. Vectors falling in the 
upper half of the compass reflect the presence 
of affect, those in the lower half result from 
the absence of affect determinants. Axes of 
Integration and Rigidity are defined by bi- 
secting the right angles formed by the control 
(X) and affect (Y) axes. Positive integration 
(+ +) thus refers to strong control combined 
with strong affect, while weak control together 
with weak affect (— —) defines its negative, 
ie., lack of integration. Similarly, Rigidity 
(+ —) refers to the ratio reflecting strong 
control with weak affect, while its opposite, 
Fluidity (— +), refers to weak control with 
strong affect. 

Previous studies have demonstrated the ef- 
fectiveness of coaction technique in differen- 
tiating the records of normal, superior (sci- 
entists), neurotic and psychotic individuals 
(Fig. 1) (Clarke & Lodge, 1955; Lodge, 
1953; Lodge & Gibson, 1953; Lodge & Guyer, 
1953; Lodge & Steenbarger, 1953). Briefly, 
some of the conclusions of these studies are 
(Clarke & Lodge, 1955; Lodge & Steenbarger, 
in press): (a) Psychologically healthy indi- 
viduals, by their greater strength in both the 
control and affect components, can be differen- 
tiated from the psychologically ill. (b) Weak- 
ness in the affect component clearly differenti- 
ates neurotics as a group from both normal 
and superior individuals. (c) Weakness in 
the control component definitely distinguishes 
hebephrenics and catatonics from both nor- 
mals and neurotics. Most psychotics also show 
a deficit in level of affect. (d) There is a con- 
tinuum of psychological health corresponding 
to the integration axis and treatment of Ror- 
schach determinant scores according to co- 
action principles this effectively differentiates 
individuals in terms of the continuum. 

The present study was initiated to further 
explore the value of this tool in personality 
research by testing its predictive value with 
respect to the performance of another group, 
ie., brain damaged. In light of the previously 
cited results of earlier coaction research, 
known psychological effects of brain damage 
(Goldstein, 1942) and other Rorschach stud- 
ies of the brain damaged (Fisher & Gonda, 
1955; Klopfer & Kelley, 1946; Pena, 1953; 
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Fic. 1. Mean vector tip positions for brain dam- 
aged and control Ss, as well as original and cross- 
validation population samples of normals, neurotics, 
and psychotics (Clarke & Lodge, 1955). 


Piotrowski: 1937, 1940; Reitan, 1955; 
Schreiber & White, 1954), two interrelated 
hypotheses were developed. 

1. Because cortical damage often results in 
inteliectual and perceptual impairment, a 
relative deficit in form level is predicted, with 
a consequent low score on the control or X 
axis for a brain damaged (BD) group. 

This postulated deficit in form level or, at 
a behavioral level, control, is further assumed 
to result in a disturbed balance between forces 
of control and affect in the personality of the 
BD individual. Behaviorally, this disturbance 
might be expected to take the form of rela- 
tively consistent overcontrol, extreme lability, 
or fluctuations between rigid control and un- 
controlled emotional outbursts. In the analo- 
gous Rorschach situation, such problems of 
behavioral integration would result in unsuc- 
cessful combination of control and affect de- 
terminants, i.e., minus form level when form 
is used in combination with other determi- 
nants (e.g., CF—, FC—), and the use of only 
one kind of determinant, such as F alone or 
pure C. These considerations lead to the sec- 
ond hypothesis. 
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TABLE 1 


Subjects N 
Controls 20 
Brain damaged 20 


2. BD individuals will have a shorter pro- 
jection on the axis of integration (X’). 

There would seem to be little psychological 
basis for a differential prediction concerning 
the expression of affect determinants in a BD 
group other than to anticipate considerable 
variability reflecting the reaction of different 
personalities to various kinds of damage. 


METHOD 


The basis of selection for both the BD and Con- 
trol groups was referral to the psychologist ,for aid 
in differential diagnosis with the question of ‘cortical 
damage as one possibility. As Piotrowski (1940) has 
pointed out, this type of control group would appear 
to be the most relevant. From a pool of 80 patients 
thus referred, 20 patients with unequivocal brain 
damage were matched for age, sex, color, and edu- 
cation with 20 patients whose ultimate medical diag- 
nosis ruled out brain damage (Table 1). The criteria 
for inclusion in the BD group was either: (a) op- 
erational removal of a tumor or lesion, (b) the pres- 
ence of one unequivocal neurological sign (such as 
known history of CVA, brain tumor, etc.), or (c) 
two or more neurologically equivocal signs where 
final medical diagnosis favored an organic cause. In 
accordance with the intended focus on possible gen- 
eral factors affecting BD behavior, there was a wide 
range of brain syndromes represented. Five of the 20 
BD patients were diagnosed as having psychomotor 
and/or grand mal seizures alone. The remaining 15 
manifested symptoms of acute or chronic brain 
syndrome following a variety of CNS disease and 
trauma, including, primarily, tumor removal, CVA, 
and cortical atrophy. 

The criteria for the Control group consisted essen- 
tially of the absence of the three above conditions, 
although the presence of equivocal neurological signs 
was acceptable if final medical diagnosis was negative 
with respect to an organic cause. Also excluded were 
patients with diseases which are frequently associ- 
ated with some ultimate cortical damage, such as 
hypertension, diabetes mellitus, or a significant de- 
gree of cardiac, pulmonary, or blood disease. The 
members of the Control group, thus determined, con- 
sisted largely of neurotic patients with considerable 
depression, anxiety, and psychosomatic or hysterical 
symptomatology. Five of the Control group patients 
were diagnosed schizophrenic. All information rele- 
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vant to the selection of subjects (Ss) was obtained 
from their medical charts rather than their psycho- 
logical test records. 

As seen in Table 1, the age range was 15 to 55 for 
both groups. The mean age of the BD group was 
31.6 as compared with 34.0 for the Control group, 
a nonsignificant difference which, in any event, would 
operate against a prediction of greater deficit in the 
BD group. The difference between the mean levels of 
education, 9.9 for the BD group and slightly higher 
(10.9) for the Control group, was also insignificant. 

All of these Ss had scorable Rorschach records in 
their psychological file. No restrictions were placed 
on the number of responses or rejections since this 
was considered to reflect realistically the clinical 
situation with respect to the types of cases under in- 
vestigation. Although previous coaction studies have 
utilized Beck’s Phillips Revised Develop- 
mental Scoring System (Phillips, Kaden, & Waldman, 
1957) was used here. Phillips’ system is considered 
particularly suited for the analysis of organic rec- 
ords because it gives credit for the positive elements 
in a percept whose over-all organization is inade- 
quate. In adapting Phillips’ definitions of adequacy 
and specificity of form to coaction levels of +, 0, 
and —, the only significant departure from previous 
studies was to equate responses scored “vague” in 
Phillips’ system with O form level on the coaction 
compass. Since some previous studies have empha- 
sized the prevalence of vague W responses in proto- 
cols of organics, it seemed this might help differenti- 
ate the records (Pefia, 1953; Schreiber & White, 
1954). It should also be noted that Phillips’ system 
tends to give credit for a greater number of determi- 
nants as a result of the conditions under which a re- 
sponse is broken down into separately scored per- 
cepts. In scoring + and — form level, Phillips’ re- 
vised rules were supplemented by the tables in Phillips 
and Smith (1953). 

The records were rescored in the Phillips system 
by the senior author, using the original scoring by 
the examiners of these patients as a guide for deter- 
minants and form level ratings. The coaction com- 
pass vector for each record was then computed ac- 
cording to the method outlined by Lodge and Steen- 
barger (1952). To guard against possible bias due to 
knowledge of the composition of the groups as well 
as to establish the reliability of the scoring system, 
the 40 records of both the BD and Control groups 
were again scored by a research technician familiar 
with the Phillips system but unfamiliar with the in- 


scoring, 


> 
56 
AGE, SEX, AND EDUCATIONAL. CHARACTERISTICS OF CONTROL AND BD Susyects <i 2 
Age Sex 
Rs Range M F > 
34.0 15-55 10 10 10.9 3-18 
31.6 15-54 10 10 = 5-14 a 
| 
iy 
var 
. 
4 


Rorschach and Brain Damaged Individuals 


tent of the study and the nature of the population.* 
The reliability scoring of the variables which affect 
coaction analysis revealed 92% agreement per proto- 
col regarding the definition of responses, 99% agree- 
ment concerning the presence of determinants other 
than form, and 88% agreement on the scoring of 
form level. This degree of agreement was essentially 
the same for both groups and indicates satisfactory 
reliability of scoring 

As a further estimate of the representativeness of 
the Ss, the records were rated for Piotrowski’s signs 
(1937). The BD group averaged 3.9 Piotrowski signs 
per record, with 9 records scoring beyond the recom- 
mended cutoff for diagnosing organicity of five or 


*The authors gratefully acknowledge the assist- 
ance of Shirley Crooke, who carried out. the reli- 
ability study 


more signs. The Control group averaged 1.95 signs 
per record, with none showing more than four signs. 
This detection of 9 out of 20 “true” positives and 
no “false” positives is similar to Fisher and Gonda’s 
findings (1955). 


RESULTS 


Figure 1 illustrates the resultant mean vec- 
tor positions with respect to the X and Y axes 
for both the BD and Control groups. For the 
Control group, the mean projection on the X 
axis was 35.84 (percentage of radius). This 
is similar to projections previously reported 
for neurotic groups of 32.59 and 48.56 (see 
Table 2). However, this group showed consid- 


TABLE 2 


ScumMMARY OF THE MEANS, SIGMAS, AND SIGMAS OF THE MEANS OF PROJECTIONS UPON THE AXES OF CONTROL 
AND AFFECT, INTEGRATION AND RiGIpITy, FoR BRAIn DAMAGED AND CONTROL GROUPS OF THE 
PRESENT STUDY AS WELL AS ORIGINAL AND Cross-VALIDATION SAMPLES* 


(The units used in these tables are percentages of the radius of the coaction compass.) 


Control 


8.05 
21.28 
4.88 


Mean 
Sigma Dis. 
Sigma Mn. 


Brain damaged 
(N = 20) 


35.84 
4.12 
94 


Mean 
Sigma Dis. 
Sigma Mn. 


Control group 
(N = 20) 


Original data 


Mean 
Sigma Dis. 
Sigma Mn. 


Psychotics 
(N = 25) 


Mean 
Sigma Dis. 
Sigma Mn. 


Neurotics 
(N = 27) 


Mean 
Sigma Dis. 
Sigma Mn. 


Scientists 
(N = 19) 


‘ross-validation data 

Mean 
Sigma Dis. 
Sigma Mn. 


Psychotics 
(N = 30) 


Mean 
Sigma Dis. 
Sigma Mn. 


Neurotics 


(N = 30) 


Mean 
Sigma Dis. 
Sigma Mn. 


Normals 
(N = 30) 


® Clarke and Lodge, 1955. 


Affect Integration Rigidity 
— 10.00 
31.43 
7.21 


— 6.14 
36.14 
829 


1.54 
27.94 
641 


6.65 
20.28 
4.65 


29 14 
18.92 
4.34 


—21 51 
15.89 
3.64 


— 16.10 
24.27 
4.95 
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erably less variation than the previous groups 
with a standard deviation of only 4.12. This 
suggests that as regards form level and its 
suggested analogue of control, these Control 
Ss (who were chosen on the basis of a cer- 
tain behavioral similarity, namely, organiclike 
symptoms) are a relatively homogeneous 
population in terms of this personality factor. 

The mean projection on the X axis for the 
BD group was 8.05 with a sigma of 21.28. 
Twelve BD Ss scored below the lowest Con- 
trol S’s score of 12.5, and only one BD S$ 
scored above the Control group mean. The 
Mann-Whitney U test was utilized to test the 
hypothesis that the two groups represented 
different populations with respect to this di- 
mension. The results supported the initial pre- 
diction of a lower score on the Control axis 
for the BD group with p < .001. In compari- 
son with results of earlier studies, the BD are 
most similar to psychotics (functional) on 
this dimension (Table 2). 

The second prediction, that the BD group 
would show a shorter projection on the Inte- 
gration axis, was also supported with a p < 
.001 according to the results of the Mann- 
Whitney U test. Table 2 indicates a mean of 
1.54 for the BD and 29.14 for the Controls in 
this axis. The distribution was such that only 
five Control Ss scored below the median pro- 
jection of 19.5 on the axis of integration, while 
only five BD Ss scored above this median. 

With regard to a breakdown of the response 
types that contributed to these two results, it 
is interesting to note that there were no sig- 
nificant differences between the groups in the 
number of vague responses. The prevalence 
of minus form level in the BD records, 
both in responses determined by form alone 
and combined with other determinants, con- 
tributed most to the present findings. It 
should be noted, however, that the final pro- 
jection on the Control axis represents a dif- 
ferent kind of information than that given by 
the F+% conventionally used in Rorschach 
scoring, since every response is weighted either 
+, —, or 0, for both form and affect. Of the 
BD records, 70% contained at least one W 
minus response, whereas this sign was present 
in only 20% of Control group records, a dif- 
ference significant beyond the .01 level. Inci- 
dentally, the total number of responses given 
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by both BD and Control Ss was unexpectedly 
similar, with both groups averaging 14.0 re- 
sponses per protocol. 

With respect to affect determinants, the 
only difference between the groups was the 
anticipated greater variability among the BD 
scores, which was significant at the .02 level 
of confidence according to an F test applied 
to the difference between the variances. Previ- 
ously tested psychotic groups also showed 
wide variability on the affect dimension. The 
mean projection on the Y axis was +6.65 for 
the Control group and —6.14 for the BD. The 
tendency of the Phillips system to recognize 
a greater number of determinants may con- 
tribute to the higher score of this Control 
group in comparison with previous norms for 
neurotic groups (Table 2). 


DISCUSSION 


The finding of significantly lower scores on 
the Control or X axis for the BD group con- 
firms the first hypothesis and suggests that 
cortical damage is associated with relativel) 
inadequate form perception. This kind of im- 
pairment may be interpreted as symptomatic 
of a more general deficit in ego strength 
(Phillips & Smith, 1953) or in coaction terms, 
control, and related to inadequacies in re- 
ality testing. 

The second hypothesis concerned the Inte- 
gration of what has been defined as control 
with factors considered to reflect emotional 
responsivity. Its confirmation by a signifi- 
cantly shorter projection on the Integration 
axis for the BD group is largely due to the 
poor form level. This leads to the generaliza- 
tion that weakened control at a behavioral 
level is the primary factor leading to a dis- 
turbed balance of control and affect compo- 
nents of personality and their resultant inade- 
quate integration. The wide range of scores 
along the affect dimension for the BD group 
suggests that this leads to a variety of emo- 
tional adaptations. On the whole, the data 
obtained indicate that the BD group is more 
similar to psychotic groups than to either 
normal or neurotic populations. This similar- 
ity is interesting in light of the fact that pa- 
tients with functional psychoses frequently 
manifest organiclike symptoms. 

In view of the fact that coaction studies 
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thus far have focused largely on comparing 
the Rorschach performance of different diag- 
nostic groups, further implications of these 
findings would require research correlated 
with more specific behavioral assessment. For 
example, a promising area for future research 
suggested by the results of the present study 
might involve investigation of the correspond- 
ence between scores on the integration—dis- 
integration and fluidity—rigidity continuua 
with actual behavioral manifestations of in- 
adequate integration of control and affect in 
brain damaged individuals. 

At present, coaction methodology represents 
primarily a systematization of Rorschach de- 
terminant scoring. This leads to the general 
question of why, in the absence of clearly 
specified behavioral correlates related to con- 
trol and affect, have coaction studies con- 
sistently yielded results which significantly 
differentiate among healthy groups as well as 
among various categories of psychopathology. 
One hypothesis is that coaction analysis rep- 
resents a geometric encoding of certain im- 
plicit inferential processes involved in the 
diagnostic behavior of clinicians. Lodge and 
Steenbarger (see fn. 2) have implied this in 
their suggestion of the fundamental equiva- 
lence of major personality theory concepts 
which tend to rest on the basic assumption 
of a two-variable coacting intrapsychic power 
system. It may be that Kraepelinian and psy- 
choanalytic models, despite differences in ter- 
minology, tend implicitly to utilize a common 
logic to analyze, differentially weight, and 
finally integrate information into diagnostic 
statements. Disregarding, for the moment, the 
problem of the validity of commonly used 
categories (for example, the distinction be- 
tween neurotic and psychotic), there is the 
possibility that the formal geometric system 
imposed upon Rorschach data by coaction 
scoring represents a significant approximation 
to the principal common elements in the logic 
of diagnosis. 

If there is, then, some theoretical justifica- 
tion for exploring possible interactions be- 
tween two such variables, for which control 
and affect are convenient approximations, the 
general framework of a cartesian coordinate 
system permits representation of the possible 
interactions between them according to a con- 


sistent logical system. As a geometric model, 
the two-dimensional coordinate system has the 
advantage over traditional verbal conceptual 
frameworks of avoiding the possible imposi 
tion of unidimensionality upon the interac- 
tions of a two-variable system. It also con- 
veniently permits visual comprehension of 
these numerous relationships, simultaneously. 
At the same time, of course, it involves a net- 
work of geometric conventions. These, how- 
ever, lead to explicit hypotheses which are 
subject to test. The relation of this geometry 
to the problem of diagnosis, its language sys- 
tem and empirical referents, then, are areas 
of research suggested by the results of this 
and other coaction studies. In view of the 
present, lack of unification in the language 
system of psychopathology, especially in its 
relationship to concepts of mental health, it 
would seem crucial to clarify whatever under- 
lying logical framework may exist. 


SUMMARY 


Coaction analysis of Rorschach divides de- 
terminants into two variables: Control (form 
level) and Affect (color, shading, and move- 
ment). Interactions between these determi- 
nants are geometrically plotted to yield a re- 
sultant vector on a cartesian coordinate sys- 
tem. On the basis of symptoms described in 


‘the brain damaged, it is predicted that they 


will manifest a breakdown of form level in 
combination with other determinants to re- 
sult in (a) a lower score on the Control axis 
and (6) a shorter projection on the Integra- 
tion axis. 

From a pool of 80 patients referred for psy- 
chological testing with questionable organic 
brain disease, 20 patients with unequivocal 
brain damage were matched on age, sex, 
color, and education with 20 patients whose 
ultimate medical diagnosis ruled out brain 
damage. The Rorschach records of these pa- 
tients were scored according to the Phillips 
Revised Rorschach Developmental Scoring 
System and then plotted on the coaction 
compass. 

The results confirm the predictions and in- 
dicate that the BD population is similar to 
previously tested psychotic groups both in 
terms of weakened control, poor integration 
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of control and affect, and considerable vari- 
ability along the affect dimension. 
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Statistical controls are thought to provide a 
means of convergence between the ambiguous 
and the known. In the analysis of data drawn 
from Q sorts it has been found that stereo- 
types such as sick-well (Kogan, Quinn, Ax, & 
Ripley, 1956) and social desirability—undesir- 
ability (Edwards, 1957) can be identified. 
The suggestion is then made that the effects 
of these stereotypes be controlled either by 
item selection or by correlational techniques. 
These recommendations rest upon work aimed 
at identifying stereotypes that parallel a 
good-—bad dimension. Much has been made of 
the ease with which personality descriptive 
items can be ordered on such dimensions. The 
actual validity of such descriptions has not 
been examined extensively. 

The Q sort technique produces an amal- 
gamation of sorter, subject, and Q deck char- 
acteristics which are difficult to refract. The 
group or case descriptions resulting from these 
multiple influences are usually considered pri- 
mary data; that is, they are rarely checked 
against other descriptions of the same cases. 

This report includes an examination of Q 
deck stereotype descriptions as they relate to 
(a) known characteristics of samples, (5) 
each other, (c) item variance, and (d) reli- 
able group differences as reflected by the 
items in the deck. 


PROCEDURE 


The cases. Four groups were examined. Each had 
approximately 70 cases who were selected because of 
their Minnesota Multiphasic Personality Inventory 
(MMPI) patterns and their delinquency status. Two 


1This study was supported by research grants 
from the graduate school of the University of Min- 
nesota and the National Institute of Mental Health, 
National Institutes of Health, United States Public 
Health Service. 


samples had MMPI codes that contained a peak on 
D, or Mf, or Si, the scales which other research has 
shown to be associated with a low rate of delin- 
quency (Hathaway & Monachesi, 1953; Wirt & 
Briggs, 1959). One of these groups had no record 
of delinquency, and one was composed of delin- 
quents. Another pair of delinquent and nondelin- 
quent samples had MMPI high point codes con- 
taining a two-scale combination of Pd, or Sc, or Ma. 
These are scale combinations which have been shown 
to be associated with delinquency proneness (Hath- 
away & Monachesi, 1953; Wirt & Briggs, 1959). The 
two obvious characteristics of the groups were the 
delinquency versus nondelinquency dichotomy, and 
the personality test differences, which suggest a 
neurotic versus character disorder dichotomy. The 
groups also differed greatly in achievement (school 
and job success) as well as other forms of construc- 
tive community activity. The neurotic nondelinquent 
sample was the most successful group on these cri 
teria of success. The two delinquent samples were in- 
distinguishable in achievement and clearly below 
both nondelinquent samples 

The rating of each sample as well as the collec- 
tion of additional information has been reported 
elsewhere (Wirt & Briggs, 1959). Cases from each 
sample were seen for intensive interviews. On the 
basis of these, the interviewers sorted a Q deck for 
each S. 

The Q deck. The deck used in this study was de- 
veloped by Block (1954). It contained 115 items to 
be distributed over a forced nine-point distribution. 
Two aspects of item variability were examined. These 
were the size of the variance when an item was used 
as descriptive of a group, and the relation of items 
showing a significant difference among group means. 

The interviewers. The interviewers were five gradu- 
ate students in psychology or social work, all of 
whom had had at least three years of clinical ex- 
perience.” 

Stereotype sorts. Following the final interview and 
Q sort, the interviewers were instructed as follows: 


Using your experience with the deck for this sam- 
ple of cases, sort the Q deck for the five concepts 
listed below: (a) social desirability of the traits in 


2 The interviewers in this investigation were Doug- 
les Freed, Arthur Gallese, Ronald Johnson, A. M. 
Marchionne, and Jay Willet. 
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TABLE 1 


MEAN INTERCORRELATIONS FOR EACH STEREOTYPE 
AMONG Five Sorters 


Meanr 


Stereotype 


_32** 
is” 
.63** 
.76** 


Delinquent 
Nondelinquent 
Neurotic 

Social desirability 
Item difficulty 


< 


the deck; (b) difficulty of sorting each item in the 
deck; (c) impression you formed of the typical 
delinquent; (d) impression you formed of the 
typical nondelinquent; (¢) impression you formed 
of the typical neurotic. 


RESULTS 


The mean Pearsonian correlation coefficients 
among sorters for each of the five sorts were 
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calculated using Fisher’s z transformation. 
They are given in Table 1. The concept of de- 
linquency was poorly agreed upon (r = .32). 
For this reason it was dropped from subse- 
quent analyses even though there was some 
reliable covariance. The agreement on diffi- 
culty in sorting was not encouragingly, high. 
The remaining values appeared to be re- 
spectable interjudge correlations. 

Mean item ratings for the four samples and 
the stereotypes were intercorrelated. These re- 
sults are given in Table 2. The matrix of in- 
tercorrelations in Table 2 has three subdi- 
visions: the interstereotype correlations, the 
intersample correlations, and correlations be- 
tween the stereotype sorts versus the mean 
sample sorts. 

The four samples were compared with each 
other in their relationship to the three stereo- 


TABLE 2 


CORRELATION COEFFICIENTS BETWEEN SAMPLE AND STEREOTYPE ITEM MEANS 


Neurotic 
Non- 
delin- 
quent 


Non- 
delin- 
quent 


Delin- 
quent 


Sample or 
Stereotype 
Neurotic 
nondelinquent 
sample 


2** 72** 


Neurotic 
delinquent 
sample 


Character 
disordered 
nondelinquent 
sample 


Character 
disordered 
delinquent 
sample 


Nondelinquent 
stereotype 


Neurotic 
stereotype 


Social * 
desirability 
stereotype 


*P < OS. 
*P < 01. 


Character Disordered 


Stereotype 


Item 
Diffi- 
culty 


Social 
Desira- 
bility 


Non- 
delin- 
quent 


Delin- 
quent Neurotic 


A1** 
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TABLE 3 


SIGNIFICANT SIMILARITIES BETWEEN STEREOTYPES AND SAMPLES 


Stereotype 


Nondelinquent Neurotic 


Social Desirability 


Samples Compared 


Neurotic nondelinquent (TI) vs. 
neurotic delinquent (IT) 


Neurotic nondelinquent (I) vs. 
character-disordered nondelinquent (IIT) 


Neurotic delinquent (I) vs. 
character disordered delinquent (IV) 


Neurotic delinquent (IT) vs. 
character-disordered nondelinquent (IIT) 


Neurotic delinquent (II) vs. 
character-disordered delinquent (IV 


Character-disordered nondelinquent (IIT) vs. 
character-disordered delinquent (IV) 


Like Sample Like Sample Like Sample 


I** 


[** 


< 01. 
.001. 


types having the highest interrater agreement: 
nondelinquency, neuroticism, and social desir- 
ability. These results are shown in Table 3. 
The validity of these stereotypes was then as- 
sessed in terms of known characteristics of 
the samples. 

Nondelinquent stereotype and known non- 
delinquency. The samples of nondelinquents 
were accurately described as less nondelin- 
quent than the delinquent samples with simi- 
lar personality test patterns. The nondelin- 
quent neurotic sample appeared about the 
same as the character disordered delinquent 
sample. In this case the stereotype did not 
distinguish the known group behaviors. How- 
ever, on the other cross-personality pattern 
comparison a significant difference was found, 
showing the nondelinquent character disor- 
dered group to be similar to the nondelinquent 
stereotype and the delinquent neurotic code 
group correctly was not described as nonde- 
linquent. The surprising thing in these find- 
ings is the degree to which the character-dis- 
ordered samples appeared more nondelinquent 
than the neurotic samples. Other data show 
that, in fact, the character-disordered sam- 
ples consisted of individuals with a lower re- 


gard for the mores than most individuals with 
test findings indicating neurotic disturbance. 
The nondelinquent stereotype apparently was 
complicated by other influences. 

Neurotic stereotype and personality test 
findings. The results for the neurotic stereo- 
type group comparisons also appear in Table 3. 
Here the neurotic and the character disor- 
dered groups were distinguished on all occa- 
sions. It is interesting to note that the delin- 
quent neurotic sample was described as more 
neurotic than the nondelinquent sample with 
similar MMPI profiles. This suggests a sur- 
plus meaning in the stereotype. It should be 
noted that, in fact, the delinquent samples 
were actually more psychopathic on the 
MMPI and in actual behavior than their 
nondelinquent test pairs (Wirt & Briggs, 
1959). 

Social desirability stereotype and other cri- 
teria of social desirability. As indicated previ- 
ously: on the criteria of social desirability, 
school and job achievement, and constructive 
community enterprise, the neurotic nondelin- 
quent sample was superior to all other sam- 
ples. According to the stereotype developed 
by the raters, however, it was the character- 
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TABLE 4 


THe NuMBER OF ITEMS JUDGED CHARACTERISTIC OF THE 
SAMPLES AND STEREOTYPES WHICH OVER- 
LAP THE EXTREMES OF SOCIAL 
DESIRABILITY 


Items 
Socially Socially 
Desirable Undesirable 
(N = 19) (N = 20) 
Sample or 
Stereotype Like Unlike Like Unlike 
Neurotic 
nondelinquent 6 0 0 11 
sample 
Neurotic 
delinquent 9 0 0 6 
sample 
Character 
disordered 
nondelinquent 10 0 0 11 
sample 
Character 
disordered 
delinquent 9 0 0 7 
sample 
Nondelinquent 
stereotype 12 0 0 13 
Neurotic 
stereotype 0 5 4 4 
Item difficulty 
stereotype 0 4 3 + 


disordered nondelinquent sample that seemed 
most desirable. Further, known delinquency 
versus nondelinquency was not reflected in 
social desirability as sorted by the raters, al- 
though certainly delinquency is less socially 
desirable than nondelinquency. The relation- 
ship between the nondelinquent and social de- 
sirability stereotypes was the highest in the 
matrix (7 = .86). 

The MMPI and delinquency status were 
effective variables, as were the nondelinquent 
and the social desirability stereotypes. All of 
these correlated with the other samples and 
stereotypes in ways which tend to reflect the 
- known group characteristics. 

The high correlations associated with social 
desirability as an a priori stereotype accounts 
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for a quite uniform amount of variance in all 
samples except the sample of nondelinquent 
character disorder individuals. This sample 
approached the stereotype of “good” person- 
ality to a striking degree. It is important to 
note that this finding is not paralleled in 
other, less subjective, data where it was found 
that these individuals would not be consid- 
ered socially desirable by conventional stand- 
ards (they are unachieving, promiscuous, un- 
reliable, etc.). 

Removal of the variance attributable to so- 
cial desirability would not appreciably alter 
the relationships in the sample matrix. An 
illustration of the overlap among all of the 
distributions underlying the matrix and the 
social desirability stereotype is provided in 
Table 4. Here the top and bottom 20 items 
in each sort were examined for overlap with 
the top 19 and bottom 20 of the social de- 
sirability items. It will be noted that in the 
nondelinquent stereotype and in the clinical 
groups there were no misplaced items. The 
correlations shown in Table 2 are, of course, 
a function of the frequency of these over- 
lapping items. However, the concreteness of 
the examples in Table 4 provides a convincing 
illustration of the strength of the correlation. 

The content of items which constituted the 
more desirable traits were intellectuality, emo- 
tionality, independence, and perspective. The 
undesirable traits seemed much more compli- 
cated and contained longer, more involved de- 
scriptions. The traits rated undesirable were 
highly interpretive and referred to intrapsy- 
chic phenomena; e.g., hostility, absence of 
warmth, and poor ego development. The ends 
of the distribution for the nondelinquent 
stereotype sorts were much the same as those 
for the social desirability items. 

The items from the extremes of the neurotic 
stereotype described insecurity and preoccu- 
pation with neurotic problems at the charac- 
teristic end, while at the nonneurotic extreme 
the items reflected an absence of concern with 
such thoughts and a happy aggressive mastery 
of situations. Again, the content of the nega- 
tive (more neurotic) items was more compli- 
cated and the descriptions were longer than 
the items sorted as more desirable. 

Content of items in relation to item diffi- 
culty. An examination of the traits that were 
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regarded as difficult or easy to sort showed 
the more difficult items seemed to refer to 
quite obscure facts, usually relating to covert 
data or complex interpersonal relations. The 
items rated as easily sorted, by contrast, re- 
ferred to more observable ways of acting or 
simply to expressed verbal opinions. The fre- 
quencies of overlap between the extremes of 
this distribution and the social desirability 
dimension extremes were presented in Table 4. 
The correlation between these two mean sorts 
was low (r = .33) but significant (P < .01). 
The small number of overlapping items shown 
in Table 4 reflects the weak relationship. 

Item variance and mean differences. A final 
pair of measures of item performance studied 
were the item variances within the samples 
and the occurrence of significant mean differ- 
ences among the sample means. From the 115 
items, 24 with low variance were selected 
(items with two sample variances out of 
four below 3.00). Seventeen high variance 
items were selected (items with two vari- 
ances greater than 5.50). The proportion of 
discriminating items in these two sets was 
almost identical. There were six items mak- 
ing 10 discriminations in the low set and five 
items making 9 discriminations in the large 
variance set. The items for the extremes of 
social desirability were not evenly represented 
among the items at the extremes of variabil- 
ity. There were four such items in the 24- 
item set with low item variability and 10 in 
the 17-item set with high variability (P< 
.01). Of these latter 10 items, five were from 
each end of the social desirability continuum. 
It was clear that the items with large vari- 
ances were more often from the extreme of 
social desirability than were items with small 
variance. As can be seen in Table 4, the two 
extremes of the social desirability stereotype 
behaved consistently (although the items 
were different). 

The ease-difficulty of sorting was examined 
for the items in the two extremes of variation. 
There were 6 items of high variance and 11 
with low variance. The easy to sort items 
were all high variance items (5 high and 0 
low), while the hard to sort items were almost 
all low variance items (1 high to 11 low; 
P< .01). Larger variances were associated 
with easier sorting (and with social desir- 


ability). Items in the extremes of sorting 
ease were, however, almost equally discrimi- 
nating in group comparisons. The number of 
items and discriminations made by the items 
in each group mean comparison for the ex- 
treme items showed no differences. Neither 
the nondelinquent nor the neurotic stereotype 
extreme items behaved differently with respect 
to variance size. However, the high variance 
items were more frequent in the nondelinquent 
stereotype extremes than were the low vari- 
ance items. This would be expected from the 
relationship between the nondelinquent and 
social desirability stereotypes. Neither set of 
extreme items was differently involved in 
group differences. 


DISCUSSION 


These data support the argument that 
stereotypes can be formed from Q deck items. 
This evidence does not, however, substantiate 
the idea that the stereotypes derived from a 
priori sorts are especially meaningful. This 
is particularly true of the social desirability 
concept. In the forced sort the content avail- 
able in the items as subtopics determines the 
form a stereotype may take. 

With this particular set of items a type of 
glib, almost psychopathic caricature emerges 
as the stereotype of the socially desirable per- 
sonality. It is quite conceivable that with 
other item sets quite a different stereotype 
might have emerged. To some degree, the 
items in this stereotype showed a high over- 
lap with all the sample descriptions. It is clear 
that the desirability continuum describes the 
nature of any group to the extent that no 
items from the undesirable extreme are re- 
versed in group description. Thus the ex- 
tremes of such a stereotype are apparently 
forced by the sorters’ knowledge of people. 
This is certainly not the typical meaning of 
a stereotype in psychological research. It more 
closely parallels the meaning of a cliché. This 
is not to imply that the items at the extremes 
of the stereotypes do not discriminate be- 
tween cases, for they manifest more than av- 
erage variability in the present study. 

Among the items in the deck some connec- 
tion between sorting ease and the stereotype 
extremes appeared clearly. Here it is sug- 
gested that stereotyping hinges upon base- 
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belief relationships. The easily sorted items 
were useful in stereotyping while more ob- 
scure factors simply did not enter. Similarly 
the easier to sort items were more variable. 

As to the intradeck relationships and the 
items describing groups, there were no obvious 
limitations or correlations. The distinguishing 
characteristics of the groups seemed to emerge 
from all types of items. 

A final point is in order. Edwards (1957) 
and others have proposed that items reflect- 
ing stereotypes be removed or controlled. This 
is a difficult procedure, since to identify the 
particular influences to be removed requires 
a very extensive analysis of the deck. But 
more importantly, it appears that valuable 
items will certainly be deleted under such a 
system of selection. 


SUMMARY 


Q sort descriptions of several samples and 
stereotypes were made by five judges follow- 
ing intensive interviews. These descriptions 
were compared with known characteristics of 
the Ss. The mean correlation among sorters 
for each of the five sorts of delinquency, non- 
delinquency, neuroticism, social desirability, 
and item difficulty was significant. The sorters 
accurately described the nondelinquents and 
the neurotic samples. They described the de- 
linquents as more neurotic than the nonde- 
linquents and the character disorder samples 
as more nondelinquent than the neurotic sam- 
ples. In fact other data showed the character 
disorder samples to be more psychopathic on 
the MMPI and in actual behavior than other 
groups. The raters described the character- 
disordered nondelinquents as the most socially 
desirable, while in fact this group was known 


to have a generally poor regard for social con- 

ventions. Known delinquency versus nonde- 

linquency was not reflected in social desir- 
ability as sorted by the raters. 

Items which were difficult to sort included 
content requiring interpretation of covert be- 
havior, while items rated as easily sorted re- 
ferred to more observable ways of acting. 
Items with large variance within samples were 
more often from the extremes of social desir- 
ability than were items with small variance. 

The data were interpreted to show that 
while the Q sort technique may be useful in 
developing definitions for stereotypes, the re- 
sulting concepts must be interpreted with 
caution. This seemed especially true of the 
social desirability stereotype. It was suggested 
that the removal or statistical control of items 
reflecting stereotypes requires extensive analy- 
sis and can result in deletion of valuable in- 
formation. 
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Of the various instruments employed to 
elicit fantasy responses, the Rorschach has 
been both the most frequently used technique 
and the one subjected to the most empirical 
research. However, despite many years of re- 
search effort, a large body of social scientists 
still question Rorschach’s basic hypotheses 
and the reliability and validity of Rorschach 
derived variables. The purpose of this paper 
is to demonstrate that selected Rorschach 
variables show low, but statistically signifi- 
cant, degrees of intra-individual stability over 
periods of three and six years during adoles- 
cence and early adulthood. Since important 
personality changes often occur during these 
years one would not expect extremely high re- 
liability coefficients over such a long period 
of time. Thus, absence of stability for a spe- 
cific variable would not mean that the vari- 
able might not be reliable for shorter test- 
retest intervals. However, demonstration of 
a Statistically significant degree of stability, 
albeit low to moderate, would suggest that the 
variable was reflecting an enduring aspect of 
human functioning and deserved more inten- 
sive study. 

There is some indication that determinant 
variables show low to moderate degrees of 
test-retest reliability when the interval be- 
tween tests is less than a week (Kaplan & 
Berger, 1956; Epstein, Nelson, & Tanofsky, 
1957). In the Epstein et al. (1957) research 
16 college students were presented with a dif- 
ferent series of specially designed ink blots 
twice weekly for five weeks. Kaplan and 
Berger administered the standard Rorschach 
stimuli four times to 28 subjects with an in- 


1 This research was supported, in part, by a re- 
search grant (M-1260) from the National Institute 
of Mental Health, United States Public Health 
Service 


terval of four days between each administra- 
tion. In both studies movement responses 
showed a moderate degree of reliability with 
correlations in the forties and fifties. How- 
ever, except for a longitudinal study of the 
consistency with which the 10 standard blots 
elicit specific responses (Ranzoni, Grant, & 
Ives, 1950), there is no evidence on the long 
term intra-individual stability of content or 
determinant variables. 

Historically, initial interest in Rorschach 
responses was centered on the formal, percep- 
tual dimensions of location and determinant. 
However, during the last decade there has 
been a rapidly growing interest in the moti- 
vational content of Rorschach responses, with 
particular emphasis on aggressive content 
(DeVos, 1952; Elizur, 1949; Finney, 1955; 
Kagan, Sontag, Baker, & Nelson, 1958; 
Rader, 1957; Schafer, 1954; Towbin, 1955; 
Wolf, 1957; Zubin, Eron, & Sultan, 1956). 
The present study, therefore, was particu- 
larly interested in the stability of aggressive 
content. The intense interest in aggressive 
content seems due, in part, to the fact that 
Rorschach stimuli have a strong tendency to 
elicit content which appears aggressive in 
meaning, e.g., dead bug, animals fighting, 
people arguing, knives, guns, bombs explod- 
ing, etc. Undisguised evidence of other dy- 
namic motivational percepts (sex or depend- 
ency) appears with less frequency. That is, 
perceptions of objects or actions symbolic of 
sexual or dependent motives (genitals, breasts, 
dogs nursing, people kissing) are less fre- 
quent than imagery normally regarded as sym- 
bolic of aggression. Second, clinicians often 
must make important predictions about a pa- 
tient’s aggressive motives and the probability 
of occurrence of overt aggressive behavior. 
This practical consideration would direct the 
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clinician’s attention to imagery which ap- 
peared to be symbolic of aggressive thoughts. 

In addition to aggressive content, the sta- 
bility of seven additional variables was ex- 
plored. The complete list of variables in- 
cluded: (a) number of responses (R), (5) 
dynamic aggression (Dyn. Agg.), (c) static 
aggression (Stat. Agg.), (d) anatomy (Anat.), 
(e) dependent-oral imagery (Dep.-Oral), (f) 
human and human detail responses (H), (g) 
human movement (M), (A) animal move- 
ment (FM), and (i) sum movement (sum M, 
FM, m). 

The stability of movement responses was 
investigated because aggressive content often 
involves a movement dimension and because 
many psychologists regard the movement re- 
sponse as one of the most important Ror- 
schach variables (Klopfer, Ainsworth, Klop- 
fer, & Holt, 1954; Phillips & Smith, 1953; 
Piotrowski, 1958). Human percepts (H) were 
analyzed in order to determine the differential 
stability of human percepts with movement 
versus human percepts with or without the 
presence of movement. Anatomy was scored 
because of its presumed diagnostic value for 
pathological conditions. Finally, dependent- 
oral imagery was scored in order to compare 
the stability of two motive categories (aggres- 
sion and dependency) that play an important 
role in the functioning of both normal and 
patient populations. Although we would have 
liked to score for location, form quality, color, 
shading, and vista, the nature of the avail- 
able data made this analysis impossible. The 
protocols were gathered over a long period of 
time by different examiners with varied in- 
quiry procedures and it was impossible to 
score for location and most determinants. 


METHOD 
Subjects and Procedure 


The subjects (Ss) were drawn from the Fels re- 
search population, which is a predominantly middle- 
class sample residing in southwestern Ohio. The fa- 
thers of these Ss were equally distributed among the 
following vocational classes: agriculture, skilled la- 
bor, tradesmen, and professional. The IQ scores of 
the Ss were all average or above with a mean Stan- 
ford-Binet IQ of 120 during the childhood years, 
and a mean Wechsler-Bellevue IQ of 123 during 
early adult life. For this study, 37 male and 38 fe- 
male Ss were selected for whom two or three stand- 
ard Rorschach protocols were available between 10 
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and 20 years of age. In addition, 52 of these Ss (30 
males and 22 females) were adult members of a cur- 
rent assessment program which has been in progress 
for several years. The ages of the 52 Ss at the time 
of the adult assessment ranged from 19 to 29 with 
a median age of 25 years. 

The longitudinal Rorschach protocols were based 
on the 10 standard Rorschach plates which were ad- 
ministered at medium ages of 104, 134, and 164, with 
30 of the males and 33 of the females having all 
three administrations. As mentioned above, the ex- 
aminers differed over the years, and the comparabil- 
ity with respect to the inquiry could not be guaran- 
teed. For this reason, only the initial, spontaneous 
verbalization of the S (the free association) was 
scored. In the adult assessment program a modifica- 
tion of the Rorschach stimuli was used which will 
be called the modified Rorschach. In this modified 
series, parts of the standard stimuli were covered 
with a template of white cardboard. Thus the S saw 
only a part of the original stimulus and gave only 
one response to each card. The 32 stimuli were com- 
mon D or d areas, each tended to be perceived as 
a unitary Gestalt, and, with one exception, all the 
stimuli were either completely achromatic or chro- 
matic. This special instrument was designed for two 
reasons. First, it guaranteed an equal and fairly 
large response pool for all Ss. The statistical prob- 
lems associated with unequal protocol length have 
long plagued Rorschach investigators, and this in- 
strument was a beginning attempt to solve this 
thorny problem. Second, this instrument gives the 
investigator considerable control over the stimulus 
input to the S and allows the investigator to specify 
those aspects of the stimulus upon which the re- 
sponse was based. It is acknowledged that this al- 
teration of the Rorschach stimuli produces a qualita- 
tively different stimulus pattern to the S. However, 
the author was interested in ascertaining if specific 
types of response would show intra-individual sta- 
bility despite alteration in the external stimulus. The 
32 stimuli in this series were congruent with the fol- 
lowing Klopfer areas (Klopfer et al., 1954, pp. 70- 
79) .? 


Rorschach 
Plate Klopfer Areas 
I (D 1) (D2) (d 3 symmetrical) 
II (D 1) (D 2 symmetrical) (D3 symmet- 
rical) (d 1) 
Ill (D 2) (D1) (D 3) (D 8 symmetrical 
plus D 3) 
IV (D 1) (d 2) (whole blot minus D 1) 
V (d 2) (whole blot minus d 3 and d 1) 
VI (d 2) (D4) (d4) 
Vil (D 4 symmetrical) (d 1 
VIII (D 2) (D1 plus D 3 plus D 4) 
(D 3 the center) (D 4 the bottom) 
IX (D 2 symmetrical) (D 5) (D 7) 
xX (D 8) (D 17) (D 3) (D 1) 


2 The author has available for dissemination photo- 
static reproductions of the 32 stimuli. 
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Scoring 


The standard and modified Rorschach responses 
were scored in the same manner for the following 
content categories. 

Dynamic aggression. This category included fe- 
sponses which involved the ascription of aggressive- 
destructive activity or a state of death or injury to 
animals, objects, or people. The category consisted 
of three subcategories: (a) aggression—animals or 
people fighting, arguing, angry, or in a belligerent or 
aggressive posture, eg., lions fighting, two ladies 
arguing, an octopus about to snatch a fish, a panther 
about to leap on its prey; (b) destruction—explo- 
sions or destructive action involving inanimate ob- 
jects, eg., bombs exploding, volcano erupting, A- 
bomb cloud, thrust of a rocket, landslide; the re- 
sponses in this destruction category all involved 
destructive action (inanimate movement) and the re- 
sponse “rocket” or “bomb” alone was not included 
in this category; (c) injury—any animal, person, or 
object seen as dead, wounded, mutilated, or dam- 
aged, e.g., dead dog, squashed bug. This latter cate- 
gory did not occur too frequently and the first two 
categories (aggression and destruction) made up 75% 
of the dynamic aggression score. 

Static aggression. This category included percepts 
of objects, animals, or animal parts which are nor- 
mally regarded as aggressive but which did not in- 
volve the ascription of aggressive or destructive mo- 
tivation or activity to the perceived object. This 
category included the following subcategories: (a) 
aggressive objects—objects or animal parts normally 
regarded as aggressive, eg. bullet, bomb, volcano, 
rocket, knife, teeth, claws, horns; (6) aggressive ani- 
mals—untamed animals normally regarded as aggres- 
sive, e.g., lions, tigers, panthers, bears, vampires; (c) 
monsters—creatures of myth or fantasy commonly 
regarded as aggressive, eg., monster, giant, witch. 
Thus, the response “bomb” or “monster” was scored 
as static aggression, while the response “bomb ex- 
ploding” or “monsters fighting” was scored dynamic 
aggression. Responses in which aggressive animals 
were engaged in nonaggressive activity were also 
scored static aggression, eg., tigers walking, lions 
climbing, etc. 

Dynamic and static aggression differ from each 
other in the degree to which the aggressive sym- 
bolism is disguised or attenuated. The dynamic ag- 
gressive response always involves undisguised aggres- 
sive action or the end result of aggressive-destruc- 
tive activity. The aggressive implications of the 
static aggressive response are more disguised and 
symbolic. That is, the responses “knife” or “tiger” 
are less obviously aggressive in meaning than the 
responses “knife tearing flesh” or “tiger devouring 
a bird.” This variable of “degree of disguise” was 
the primary basis upon which these two aggressive 
categories were separated 

Anatomy. Images of the internal body parts of 
animals or people or X rays of these internal body 
parts were scored anatomy, eg., stomach, spinal 
column, X ray of the chest, lungs, kidney. 


Dependence-orality. This category included im- 
agery which many clinicians normally regard as in- 
dicative of dependency motivation. This variable in- 
cluded the following two subcategories: (a) passive- 
dependent activity—animals or people in a passive 
posture or in a dependent activity; eg., people lying 
down, people resting or sleeping, pigs suckling, dogs 
nursing, people supporting each other; (b) oral con- 
tent—responses in which people or animals were eat- 
ing or drinking and responses describing food or ob- 
jects directly associated with food or eating, eg., 
people eating, people drinking, ice cream, fried 
chicken, people sitting in a restaurant, side of beef, 
pitcher of milk. 

Human or human detail. This category included 
all images of humans or parts of humans, e.g., per- 
son, man sitting, face of someone, child, hands, etc. 

Movement. All the Rorschach responses were coded 
separately for the standard categories of M, FM, 
and m as described by Klopfer et al. (1954). The 
present scoring of m includes Klopfer’s Fm, mF, 
and m. A total score (sum M, FM, m) was obtained 
by summing the three movement scores. 

Response length. The variable R referred to the 
number of spontaneous responses in each protocol. 

The interater reliability for the scoring of the first 
six categories was satisfactory for the over-all per- 
centage of agreement between the author and an in- 
dependent scorer was 92%. 


RESULTS 
Age and Sex Differences 


Table 1 shows the frequency of occurrence 
of the variables for the three standard Ror- 
schach protocols as well as the modified Ror- 
schach for males and females separately. 

There were no statistically significant age 
differences for the movement or content vari- 
ables for the three standard Rorschach proto- 
cols. Occurrence of sum M, FM, m showed a 
moderate but nonsignificant increase with age 
for both sexes. The increase in response length 
from Protocol 1 to Protocol 3 was significant 
for males only (p< .01; two tails). Since 
the modified Rorschach elicited more re- 
sponses than the standard plates and involved 
different stimuli, the differences in response 
frequencies between Protocol 3 and the modi- 
fied Rorschach cannot be attributed solely to 
age changes. 

None of the variables produced significant 
sex differences for all administrations. On 
Protocol 3, the males had a larger sum M, 
FM, m score (p < .05; two tails) but this 
was due, in part, to the fact that the males 
produced longer protocols than the females. 
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TABLE 1 


PERCENTAGE OF Ss REPORTING ONE OR MorE OF EACH RESPONSE CATEGORY AND THE 
MEDIAN NuMBER OF RESPONSES FOR EACH PROTOCOL 


Kagan 


Protocol 1 Protocol 2 


Protocol 3 Mod. Rorschach 


Category 


54.1 39.5 


Dyn. Age. 45.9 42.1 10 0.0 2.0 
Stat. Age. 81.1 86.8 2.0 2.0 81.1 86.8 2.0 
Anatomy 37.8 42.1 0.0 0.0 48.6 28.9 0.0 
Dep-Oral 37.8 42.1 0.0 0.0 37.8 44.7 0.0 
H 83.8 78.9 2.0 2.0 75.7 94.7 2.0 
M 59.5 63.2 10 1.0 43.2 63.2 0.0 
FM 64.9 52.6 10 1.0 62.2 57.9 1.0 
m 21.6 36.8 0.0 0.0 35.1 13.2 0.0 
Sum M,FM,m 784 78.9 20 2.0 67.6 71.1 2.0 
R 18.0 20.0 21.0 


0.0 53.3 51.5 86.7 72.7 3.0 1.0 
2.0 90.0 93.9 3.0 2.0 96.7 100.0 30 40 
0.0 53.3 48.5 1.0 6.0 66.7 SOA 10 1.0 
0.0 53.3 42.4 1.0 0.0 63.3 72.7 1.0 1.0 
3.0 96.8 93.9 40 3.0 96.7 100.0 45 4.0 
1.0 76.7 57.6 1.5 1.0 86.7 72.7 20 2.0 
1.0 73.3 72.7 2.0 1.0 100.0 90.9 40 4.0 
0.0 30.0 30.3 0.0 0.0 73.3 63.6 10 10 
2.0 86.7 84.8 5.0 3.0 100.0 95.5 7.0 7.0 
21.0 30.0 23.0 32.0 32.0 


The only other significant sex difference was 
obtained on the modified Rorschach with the 
males producing significantly more dynamic 
aggression (p < .05; two tails). This result 
is difficult to interpret since it did not appear 
on the earlier protocols. It is possible that the 
modified Rorschach, by eliciting a larger re- 
sponse pool, may have facilitated the emer- 
gence of latent sex differences on this vari- 
able. The absence of consistent sex differences 
for these variables is in essential agreement 
with other research. Ranzoni, Grant, and Ives 
(1950), in a longitudinal study of adoles- 
cents, did not find important sex differences 
in type of response to each of the 10 stimuli. 
Stavrianos (1942) and Ames, Learned, Me- 
traux, and Walker (1952) did not find sig- 
nificant sex differences in movement responses 
with 10-year-old children. In a more recent 
study, Baughman and Guskin (1958) failed 
to find any stable sex differences among 17 
Rorschach scores with a group of “normal” 
adults. 


Stability 


Since the median frequency of occurrence’ 
of each of these response classes was low, the 
stability of occurrence was assessed by the 
phi coefficient. For each protocol, the distri- 
bution for each variable was split at the me- 
dian and chi square and phi coefficients were 
computed (with Yates’ correction for conti- 
nuity) from these 2 X 2 distributions. In 
cases where more than one S fell at the me- 
dian, the distribution was divided so that the 


proportion of cases in each of the two groups 
was as Close as possible to a 50-50 split. A 
rank order correlation was computed to assess 
the stability of response length (R). Inani- 
mate movement (m) was of such infrequent 
occurrence that stability coefficients are not 
presented for this category. The stability co- 
efficients for the modified Rorschach involved 
a comparison of each S’s responses to the last 
standard protocol with his responses to the 
modified Rorschach. 

Table 2 presents the stability coefficients 
for these variables for four interprotocol com- 
parisons for males and females separately. 

Although none of the coefficients would be 
considered high, about one third of them were 
statistically significant. Of the five content 
categories, only dynamic aggression and anat- 
omy showed significant stability coefficients 
for both sexes, with the highest coefficients 
obtained for the Protocol 2 and 3 compari- 
son. The occurrence of dynamic aggression on 
Protocol 3 was correlated with its presence on 
the modified Rorschach for males but not for 
females. 

For the males, the human movement score 
showed a significant degree of stability over 
the first three protocols (ages 103 to 1634), 
and human movement was much more stable 
than animal movement. Of the 12 stability 
coefficients involving the three movement vari- 
ables (M, FM, and sum M, FM, m) 58% 
were statistically significant for males and 
25% for females. Although human movement 
responses were moderately stable, the cate- 
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gory of human percepts (H) showed no evi- 
dence of stability. This suggests that the as- 
cription of action or motivation to a human 
percept is of psychological significance. The 
comparison involving Protocol 3 and the 
modified Rorschach yielded only three sig- 
nificant coefficients. Since the modified stimuli 
differed from those of the standard Rorschach 
it is impossible to determine whether age 
and/or the alteration in stimuli led to the 
decreased number of stability coefficients as 
compared with the Protocol 2—3 comparison. 

Protocol length showed fairly consistent sta- 
bility for both sexes, and thus “productivity” 
appears to be a moderately stable behavioral 
tendency. For the content categories, the com- 
parisons involving Protocols 2—3 produced 
more significant stability coefficients than the 
comparisons between Protocols 1-2. The com- 
parison between Protocols 1-3 produced the 
least number of significant coefficients. These 
results tentatively indicate that the stability 
of content variables increases with age and 
decreases as the interprotocol interval in- 
creases. 

It should be noted that the stability of a 
category was not positively related to its fre- 
quency of occurrence. Static aggression, with 
minimal stability, was the most frequently 
occurring category, while dynamic aggression 
was of relatively infrequent occurrence. Phi 
coefficients relating each of the variables with 
response length were low and positive, but in 
no case were they statistically significant. 
These results compare with those of Fiske 


and Baughman (1953) who reported a com- 
plex and generally nonlinear relationship be- 
tween various Rorschach variables and re- 
sponse length. 

Because of the differential stability of dy- 
namic and static aggression the correlation 
between these two categories was obtained 
for the present group of males and females 
and for additional samples of 32 male and 22 
female adolescents. The phi coefficients were 
all close to zero (range of —.13 to +.03), in- 
dicating no association between these two 
variables. This zero order relationship held 
for both the standard Rorschach as well as 
for the modified Rorschach stimuli. Further- 
rore, an analysis was made of the relation- 
ship between occurrence of these two cate- 
gories and type of eliciting stimulus for the 
modified Rorschach. This task was impossible 
for the standard Rorschach plates because of 
the incomplete inquiry procedures. The results 
revealed that dynamic aggression occurred 
more often to chromatic than achromatic 
stimuli (p < .05; two tails), while static ag- 
gression was elicited more often by achro- 
matic cards (p < .05; two tails). Thus, static 
and dynamic aggression were (a) differen- 
tially stable over time, (4) statistically inde- 
pendent, and (c) associated with different 
kinds of external stimuli. 


DISCUSSION 


The results are certainly not as orderly as 
one would want, and the sizes of the stability 
coefficients are quite modest. Although this 
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research had no a priori hypotheses about the 
outcome of the analysis, the data suggest 
several tentative conclusions. One of the pur- 
poses of the study was to investigate the long 
term stability of aggressive content. The dif- 
ferential stability and statistical independ- 
ence of dynamic and static aggression indi- 
cate that these two variables are indices of 
different psychological processes. It is sug- 
gested, therefore, that future research with 
Rorschach aggressive content consider these 
two response classes separately. Research 
now in progress is directed at an explication 
of the psychological significance of the dy- 
namic aggressive response. 

As mentioned earlier, except for protocol 
length, movement responses showed the most 
consistent stability for the variables analyzed 
in this study. Seven of the 16 stability coeffi- 
cients involving M and FM were statistically 
significant at the .05 level or better, and hu- 
man movement had the highest average sta- 
bility coefficient for the four interprotocol 
comparisons (average phi = .40 for men and 
.29 for women). It is of interest to note that 
the investigations described earlier on the 
short term reliability of movement responses 
yielded coefficients that were not much higher 
than the ones reported here. Epstein et al. 
(1957) reported reliability coefficients for M, 
FM, and sum M, FM, m of .45, .49, and .61 
respectively (all significant at p< .01) for 
protocols separated by three days. Kaplan 
and Berger (1956) reported correlations of 
51 (p< .01) for M and .41 (p< .05) for 
FM +m with a four-day interval between 
tests. Thus the reliability coefficients for 
movement imagery with test retest intervals 
of several days did not differ markedly from 
the coefficients obtained in this study with an 
interval of several years. This finding indi- 
cates that the movement response, as now de- 
fined, is not reliable enough to be used for 
individual prediction. However, the present 
demonstration of stability over a three-year 
period for males and females and over a six- 
year period for males suggests that the hu- 
man movement response is a crude index of 
a relatively stable personality construct. 

Despite the fact that changes in values and 
motives occur during these years, it is gen- 
erally assumed that some ego processes do re- 
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main stable over time. Research and theory 
suggest that the human movement response 
is associated with an introspective attitude 
and an awareness of motives, conflicts, and 
sources of anxiety. The research of Affleck and 
Mednick (1959), Gibby, Stotsky, Miller, and 
Hiler (1953), Singer, Wilensky, and Mc- 
Craven (1956), and King (1958) indicate 
that movement responses are produced by in- 
dividuals who tend to be aware of their mo- 
tives, conflicts, and sources of anxiety and 
are willing to talk about these intrapsychic 
phenomena. These findings agree with the 
statements of Klopfer et al. (1954) that 
movement imagery reflects an availability to 
consciousness of thoughts that are directly 
related to basic motives and conflicts. 

Current personality theory relies heavily on 
constructs which describe motives, defenses, 
or styles of behavior. The human movement 
response presumably taps a style of thinking 
or preferred manner of dealing with a motive 
rather than a motivational predisposition like 
dependency or aggression. Since human move- 
ment showed greater evidence of stability than 
most of the present content categories, it is 
possible that preferred modes of thinking 
(cognitive styles) may be more resistant to 
change than the strength of motives and 
values. Theoretical refinement of the move- 
ment variable together with the construction 
of superior instruments should produce more 
acceptable levels of reliability and a clearer 
understanding of the meaning of this be- 
havior. 


SUMMARY 


This paper summarized data on the long 
term intra-individual stability of selected Ror- 
schach categories for three standard Ror- 
schach protocols and a modified set of Ror- 
schach stimuli. 

The Ss were 37 males and 38 females for 
whom standard Rorschach protocols were ad- 
ministered at median ages of 103, 13}, and 
16}. In addition, 52 of these Ss were adminis- 
tered 32 modified Rorschach stimuli at a me- 
dian age of 25 years. The modified stimuli 
were prepared from the standard plates and 
were congruent with common D and d areas. 
All protocols were scored for number of re- 
sponses, human, animal, and inanimate move- 
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ment, sum of all movement responses, dy- 
namic aggression, static aggression, anatomy, 
dependent-oral imagery, and human plus hu- 
man detail responses. 

The results revealed that: 

(a) For the standard Rorschach protocols, 
number of responses, dynamic aggression, 
anatomy, dependence-orality, human move- 
ment, and sum movement showed low but 
statistically significant degrees of stability 
during the ages 134 and 16} (phi coefficients 
ranged from .39 to .59). Static aggression and 
human percepts showed no evidence of sta- 
bility. 

(4) In comparing the last standard Ror- 
schach with the modified Rorschach stimuli, 
dynamic aggression and animal movement 
(for men) and human movement (for women) 
showed a significant degree of stability (phi 
coefficients of .60, .42, and .47 respectively). 

(c) Dynamic aggression, which usually in- 
volved aggressive movement and contained 
undisguised aggressive content, was statisti- 
cally independent of static aggression, which 
did not involve aggressive action and con- 
tained more disguised aggressive symbolism. 
In addition, dynamic aggression occurred 
more frequently to chromatic cards while 
static aggression was elicited more often by 
achromatic stimuli. It was suggested that fu- 
ture scoring codes for aggressive content dif- 
ferentiate between dynamic and static aggres- 
sive responses. 

(d) Although the stability coefficients for 
human movement responses were low, this 
variable showed the most consistent stability 
during adolescence for the males. It was sug- 
gested that human movement responses re- 
flected an “awareness of motives and con- 
flicts” and that this construct appeared to be 
a relatively stable personality variable. 


REFERENCES 

Arrieck, D. C., & Mepnicx, S. A. The use of the 
Rorschach test in the prediction of the abrupt 
terminator in individual psychotherapy. J. consult. 
Psychol., 1959, 23, 125-128. 

Ames, Louise B., Learnep, Janet, Metraux, 
W., & Wacker, R. N. Child Rorschach responses. 
New York: Hoeber, 1952. 

BaucuMan, E. E., & Gusxrn, S. Sex differences on 
the Rorschach. J. consult. Psychol., 1958, 22, 400- 
401. 


DeVos, G. A quantitative approach to affective sym- 
bolism in Rorschach responses. J. proj. Tech., 
1952, 16, 133-150. 

Exizur, A. Content analysis of the Rorschach with 
regard to anxiety and hostility. Ror. Res. Exch., 
1949, 13, 247-284. 

Epstetn, S., Netson, Jane V., & Tanorsxy, R. Re- 
sponses to inkblots as measures of individual dif- 
ferences. J. consult. Psychol., 1957, 21, 211-215. 

Finney, B. C. Rorschach test correlates of assaul- 
tive behavior. J. proj. Tech., 1955, 19, 6-16. 

Fiske, D. W., & Baucuman, E. E. Relationships be- 
tween Rorschach scoring categories and the total 
number of responses. J. abnorm. soc. Psychol, 
1953, 48, 25-32. 

Gissy, R. G., Stotsxy, B. A., Mirrer, D. R., & 
Hirer, E. W. Prediction of duration of therapy 
from the Rorschach test. J. consult. Psychol. 
1953, 17, 348-354. 

Kacan, J., Sontac, L. W., Baker, C. T., & Netson, 
Viecrinia L. Personality and IQ change. J. abnorm. 
soc. Psychol., 1958, 56, 261-266. 

Kaptan, B., & Bercer, S. Increments and consistency 
of performance in four repeated Rorschach ad- 
ministrations. J. proj. Tech., 1956, 20, 304-309. 

Kino, G. F. A theoretical and experimental consid- 
eration of the Rorschach human movement re- 
sponse. Psychol. Monogr., 1958, 72 (5, Whole No. 
458). 

Kioprer, B., Aivswortn, Mary D., Kioprer, W. G., 
& Hott, R. R. Developments in the Rorschach 
technique. Vol. 1. New York: World Book, 1954. 

Puitups, L., & Smirn, J. G. Rorschach interpreta- 
tion: Advanced technique. New York: Grune & 
Stratton, 1953. 

Protrowsk!, Z. Perceptanalysis. New York: Mac- 
millan, 1958. 

Raver, G. E. The prediction of overt aggressive 
verbal behavior from Rorschach content. J. proj. 
Tech., 1957, 21, 294-306. 

Ranzoni, Jane H., Grant, Marcverttre Q., & Ives, 
Vircinta. Rorschach card pull in a normal ado- 
lescent population. J. proj. Tech., 1950, 14, 107- 
133. 

Scuarer, R. Psychoanalytic interpretation in Ror- 
schach testing: Theory and application. New York: 
Grune & Stratton, 1954. 

Srncer, J. C., Wmensxy, H., & McCraven, V. G. 
Delaying capacity, fantasy, and planning ability: 
A factorial study of some basic ego functions. J. 
consult. Psychol., 1956, 20, 375-383. 

Stavrianos, Bertua. An investigation of sex differ- 
ences in children as revealed by the Rorschach 
method. Ror. Res. Exch., 1942, 6, 168-175. 

Towsrn, A. P. Hostility in Rorschach content and 
overt aggressive behavior. Unpublished doctoral 
dissertation, Yale Univer., 1955. 

Wo tr, I. Hostile acting out and Rorschach test con- 
tent. J. proj. Tech., 1957, 21, 414-419. 

Zustn, J., Eron, L. D., & Suttan, Frorence. A psy- 
chiatric evaluation of the Rorschach experiment. 
Amer. J. Orthopsychiat., 1956, 26, 773-782. 


(Received February 11, 1959) 


— 


i We 
2 
iis 
ri 
Sa a 
j 
x 
¢ 
A } 
a 


Journal of Consulting Psychology 
Vol. 24, No. 1, 1960 ‘ 


There are essentially two main points of 
view regarding the factor structure of the 
MMPI (Hathaway & McKinley, 1951). On 
the one hand, there are those like Wiener 
(1948) and Wheeler, Little, and Lehner 
(1951) who stress what might be called the 
two-factor position. The former has classified 
the items of the MMPI clinical scales into 
the two categories of obvious and subtle, while 
the latter have concluded on the basis of a 
factor analysis of the MMPI scales that the 
test, “. . . permits diagnosis mainly in terms 
of ‘neurotic’ or ‘psychotic,’ but not in terms 
of type of neurosis or psychosis or other more 
specific category” (Wheeler et al., 1951, p. 
140). On the other hand, there are those like 
Harris and Lingoes (1955) and Fisher (1957) 
who believe that there are reliably more than 
two dimensions to this test (the multifactor 
position). 

It is the purpose of this study to test the 
relative merits of these two positions. Some 
considerations that might be advanced for the 
multifactor viewpoint are: (a) Since the origi- 
nal publication of the MMPI a very large 
number of special, cluster, factor, and ra- 
tiogally derived scales have been published. 


1The major part of the statistical analysis was 
carried out while the investigator was on the staff 
of the Langley Porter Neuropsychiatric Institute, 
San Francisco. Appreciation is expressed to the Uni- 
versity of California Computer Center at Berkeley 
for granting free use of the IBM 701 computer, and 
to the National Science Foundation for their sup- 
port of basic research using this computer. Further 
appreciation is expressed to the Michigan State 
University Computer Center for free access to the 
MISTIC computer to complete the analysis. Grateful 
acknowledgment is expressed to C. F. Wrigley, R. E. 
Harris, H. H. Anderson, L. L. McQuitty, C. Hanley, 
and G. H. Crook for their helpful suggestions and 
criticisms in the preparation of this paper. 
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Some of these new scales have proved helpful 
in both clinical and nonclinical settings, e.g., 
Drake’s Social I-E scale (1946), Feldman’s 
Prognosis for Shock scale (1951), Barron’s 
Ego Strength scale (1953), Gough’s Dissimu- 
lation scale (1954), and Harris’s Subscales 
(1955), among others. Yet, however useful 
some of these scales may be, it is improbable 
that there are as many dimensions of per- 
sonality or functioning as there are existing 
scales. It seems equally unlikely, on the other 
hand, that as many scales as this could have 
been developed in such a variety of settings, 
on quite heterogeneous groups, for ostensibly 
different purposes, and using such seemingly 
different criteria, and yet hold the view that 
the MMPI is capable of measuring only two 
basic factors. (4) Clinicians in practical set- 
tings seem to operate under the assumption, 
in writing their MMPI interpretations, that 
the test is useful in tapping attitudes and 
traits as well as in separating patients from 
normals and they attempt to rely upon a con- 
figural approach in doing this. (c) Previous 
factor analytic results may, have been due in 
part to the selection of factorially complex, 
composite scales which may have tended to 
obscure rather than clarify the factor struc- 
ture of this test. A hypothetical example of 
how such a situation might arise is the fol- 
lowing. Let us suppose that different factors 
were involved in the skills of multiplication, 
division, addition, and subtraction. Now it is 
conceivable, depending upon the tests in- 
cluded for analysis, that a different factor 
structure would result according to whether 
one used a total score based on these four 
tests or used subtest scores. One might in the 
former instance come out with a general 
quantitative factor and in the latter instance 
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with both a quantitative factor and common 
factors for each of the skills involved, since 
the variance of the separate tests would be al- 
lowed to vary independently from the total 
test score. 

To test the multidimensional hypothesis, 
the investigator felt that the independently 
developed rational subscales of Harris (1955) 
and of Wiener (1948), to be described below, 
would provide an excellent opportunity, since 
these subscales represent two different hy- 
pothe-«s about the MMPI structure. Both in- 
vestigators started with the standard MMPI 
clinical scales but classified the items in these 
scales differently. 


DESCRIPTION OF MMPI Supsca.es 
FOR ANALYSIS 


The Wiener subscales were derived by sort- 
ing items from each clinical scale into two 
groups. Those items which were relatively 
easy to detect as indicating emotional dis- 
turbance were called the Obvious items, while 
the remaining items were called Subtle. There 
were two main criteria: (a) how frequently 
the item was scored in the given direction by 
the normative samples and (/) subjective 
estimation of the implied emotional disturb- 
ance of the item. Items which were infre- 
quently answered by normals were automati- 
cally considered to be obvious, e.g.. all items 
appearing on the F scale. 

Since the Harris subscales have not yet 
been published, the rationale underlying their 
construction will be in more detail. 
These subscales were derived in conjunction 
with a study of prediction of response to psy- 
chotherapy (Harris & Christiansen, 1946). 
Differences on the Pd, Pa, Sc, and Ma stand- 
ard MMPI obtained between 
groups responding well and poorly to therapy. 
Neither group, however, was very high on 
any of these scales. Although one might have 
inferred something like “subclinical psychotic 
trends” in those responding poorly, there re- 
mained the possibility of achieving a more 
psychological definition of the differences by 
an inspection of differences on the items within 
these scales. Subscales were formed by group- 
ing together the items within each scale which 
scemed similar in content or which seemed to 


given 


scales were 


reflect a single attitude or trait, ie., item 
clusters were subjectively estimated. Upon 
scoring the two therapy groups on the vari- 
ous subscales, a number of differences emerged 
which led to an interpretation in terms of 
ego strength. Later research by Feldman 
(1951) and Barron (1953) on patients re- 
sponding well and poorly to various kinds 
of somatic and psychotherapy suggested that 
this interpretation was a sensible one. Fur- 
thermore, the clusterings that Harris made 
reveal a coincidence of overlap with Comrey’s 
analyses of the MMPI scales at the item 
level, notably on Hy (Comrey, 1957b) and 
on Pa (Comrey, 1958b), thus substantiating 
to a considerable extent the adequacy of 
Harris’s estimates. 

Eight of Wiener’s scales (HyO and HyS 
being omitted because of their near perfect 
correlations with the Admittance and Denial 
cluster scales of Little and Fisher (1952), 
which are to be subsequently analyzed), and 
28 of Harris’s subscales (1955) constituted 
the 36 variables of this study. The capacity 
of the IBM 701 program would allow for no 
more than this number; hence, some selection 
was necessary. This selection, however, in no 
way restricts testing the multidimensional hy- 
pothesis. Table 1, containing the various sub- 
scale names and descriptions, gives some idea 
of the content of these scales. 


HyYPoTHESES 


There would appear to be only four pos- 
sible outcomes to a factor analysis of the 
Harris and the Wiener subscales (if one ex- 
cludes the unifactorial position), i.e., (@) two 
major factors corresponding to the psychotic 
and neurotic factors of Wheeler et al. (1951): 
(b) one large factor with high loadings on the 
Obvious scales and another large factor with 
high loadings on the Subtle scales, which 
would support the two-factor view of Wiener 
(1948); (c) six factors, one for each of the 
six standard scales, with subscales derived 
from the same standard scale appearing on 
the same factor, e.g., Paranoia-Obvious, Para- 
noia-Subtle, and Harris’s Paranoia subscales 
of Pa,, Pas, and Pax, which would imply a 
scale homogeneity which the authors of the 
MMPI did not purposefully attempt to 
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achieve; and (d) a classification cutting standard scales, e.g., it is known that there 
across both the standard scales and the are two quite different kinds of items in the 
Wiener subscales, in conformity with the Hysteria scale and that they are negatively 
viewpoint of Harris and Lingoes (1955). Hy- correlated (Little & Fisher, 1952). The fourth 
pothesis 3 would not appear to be a serious hypothesis is favored by this investigator be- 
contender, since various investigators have cause of the considerations advanced at the 
demonstrated the factorial complexity of the beginning of this paper. 


TABLE 1 


SUBSCALE DESCRIPTIONS 


Scale Description Scale Description 

DO  Depression-Obvious Paz Poignancy: thinking of oneself as something 

DS _ Depression-Subtle special and different from other people; high- 

D, Subjective Depression : a negation of joy in doing strung; cherishing of sensitive feelings; overly 
things; pessimism, poor morale, and low self- subjective ; thin-skinned 


esteem ; complaints about psychological inertia 
and lack of energy for coping with problems 
D, Psychomotor Retardation: nonparticipation in 
social relations ; immobilization 
D; Complaints about Physical Malfunctioning: pre 
occupation with oneself 


Pa; Affirmation of Moral Virtue: excessive gener 
osity about the motives of others; righteous 
ness about ethical matters; oltuse naiveté; 
denial of distrust and hostility 

Scia Social Alienation: a feeling of lack of rapport 


dD, Mental Dullness: unresponsiveness; distrust of with other people; withdrawal from meaning: 
one’s own psychological functioning ful relationships 

Ds Brooding: ruminativeness; irritability Scis Emotional Alienation: a feeling of lack of rap 

Hy, Denial of Social Anxiety: social introversion port with oneself; experiencing the self as 

Hv, Need for Affection and Reinforcement from strange; flattening or distortion of affect; 
Others: implied in an (obtuse) denial of a , apathy 
critical or resentful attitude toward other | Sc., Lack of Ego Mastery, Cognitive: the admissior 
people ; impunitiveness; overly protested faith of autonomous thought processes, strange and 
and optimism in other people puzzling ideas 

Hy; Lassitude-Malaise: complaints about function- | s-., Lack of Ego Mastery, Conative: feelings of 


ing below par physically and mentally; effort 
ful keeping up of a good front; need for atten 
tion and reassurance 

Hy, Somatic Complaints: of a kind that suggest re- 
pression and conversion of affect 


“‘psychological weakness’; abulia, inertia, 
massive inhibition; regression 
Scxc Lack of Ego Mastery, Defect of Inhibition and 
Control: a feeling of not being in control of 
sare : one’s impulses, which may be experienced as 
Hys Inhibition of Aggression: expressed by concur lal A 
feelings; dissociation of affect 
PdS Psychopathic Deviate-Subtle Se; Sensorimotor Dissociation : a feeling ve! chang: 
ege 9 > tio . » hody 
Pd, Familial Discord: struggle against familial con = the pc Scepuon the self the vod) 
trol image; feelings of depersonalization and es 
Pd, Authority Conflict: resentment of societal de- trangement 
mands and conventions and parental standards | 4aO Hypomania-Obvious 
Pd; Social Imperturbability: denial of social anxiety; | AfaS Hypomania-Subtk 
blandness; denial of dependency needs Va, 
Pd, Social Alienation: feelings of isolation from other 
people; lack of belongingness; externalization 


Amorality: a callousness about one’s own mo 


| tives and ends and those of other people ; 


disarming frankness; denial of guilt 
of blame for difficulties ; lack of gratification in 
Ma, Psychomotor Acceleration: hyperactivity, labil 
ity; flight from ‘‘inner life’ and anxiety; 


Pdyp_ Self-alienation: lack of self-integration; avowal 

of guilt, exhibitionistically stated ; despondency 

PaO Paranoia-Obvious Ma, Imperturbability: affirmations of confidence in 

PaS_ Paranoia-Subtle social situations; denial of sensitivity; procla 

Pa, Ideas of External Influence: externalization of | mation of independence from the opinions of 
| 


pressure for action 


blame for one’s problems, frustrations, fail other people 
ures; in the extreme degree, persecutory ideas; 
projection of responsibility for negative feelings 


Ma, Ego Inflation: feelings of self-importance to the 
point of unrealistic grandiosity 
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MeEtTHOop 
Subjects 


The four samples of Ss used in this study con- 
sisted of 150 male and 100 female neuropsychiatric 
patients (58% were outpatients and 42% inpatients), 
and 100 male and 100 female community Ss. These 
four groups will be hereinafter referred to by the 
abbreviations: Pm, Pf, Cm, and Cf, respectively. 
Median ages and age ranges of these four groups 
9-65; 33, 14-63; 25, 19-47; and 29. 19 62, 
years, respectively. Patient Ss were randomly chosen 
from the Langley Porter Neuropsychiatric Institute 
MMPI files, the only criteria for selection being that 
not more than 30 items were left unanswered on the 
test and that the patient had been tested on. the 
group form. Community Ss were for the most part 
drawn from the staff at the same institution ona 
voluntary basis, while the remainder were drawn 
from used in another study (Lingoes, 1957) 
and from college students at the University of Cali- 
fornia at Berkeley, using the same criteria for in- 
clusion. No attempt was made to match these gr ups 
on various variables nor was there any attempt to 
make the groups homogeneous. The implications of 
this lack of matching and the heterogeneous nature 
of the samples will be discussed later. Suffice it to 
say here that the patient samples did not differ ap- 
preciably from each other in age or education. Both 
patient samples were in general older and of lower 
socioeconomic status than the community Ss 


are: 32, 


cases 


Procedure 


An independent factor analysis. employing Hotel- 
ling’s (1933) principal components method, was ap- 
plied to each set of correlations for the four groups 
of Ss. The squared multiple correlation (SMC) was 
used in the diagonal. This index is known to be 
the best universal lower bound for the communality 
(Guttman, 1940) and is calculated by taking each 
variable in turn as the dependent variable and com- 
puting the multiple correlation of that variable with 
the p-1 remaining variables and taking its square 
‘root. The SMC is thus seen to be a measure of pre 
dictable or observed common test variance. ie. 
variance common to the test itself and the remain 
ing p-1 tests. In an empirical investigation, using 15 
communality estimates, Wrigley (in press) has found 
SMCs to be one of the best estimates of communali 
ties in terms of stability, rate of convergence to, and 
discrepancy from, the true communalities. In dis- 
cussing the logical and practical issues of the com- 
mon-specific variance distinction in factor analysis 
and the problem of communalities, Wrigley (1957, p 
94) has offered a number of cogent arguments favor- 
ing the use of SMCs over that of communalitics, ez. 
“The problem of alternative sets of diagonal values 
satisfying the theory then disappears. SMC’s are 
probably less influenced by increasing the sample of 


? Gratitude is expressed to Alex Sheriffs for the use 
of his MMPI files. 


persons, since the revised values, though more ac- 
curate, may be either higher or lower. Di termining 
the diagonal entries becomes separate from deter- 
mining the number of significant factors. One disad- 
vantage is the lack of a significance test »” which 
has subsequently been provided by Tryon ® and used 
in this study to determine whether the factors ro- 
tated were significant. 

A principal axes solution yields as many factors as 
there are tests. Using SMCs in the leading diagonal 
instead of unities, however, gives rise to factors hav- 
ing negative latent roots. Only those factors with 
positive latent roots and with at least one signifi- 
cant loading, i., P less than .05, as determined by 
Tryon’s (1956) method were rotated. The method of 
rotation adopted was that of Kaiser (1958), called 
the Varimax method, an analytic solution approach- 
ing orthogonal simple structure; ie., the number of 
zero loadings for any particular factor are maxi- 
mized so that the factor variance is concentrated so 
far as possible in a few tests. The method operates 
on all possible pairs of factors and finds that angular 
transformation which maximizes the variance of the 
squared loadings, summed over the entire matrix. 
proceeding iteratively until convergence occurs. In 
order to preserve objectivity and make for com- 
parability, no attempt was made to improve upon 
the rotations by hand methods, although it was pos- 
sible that a better approximation to simple structure 
(using other criteria of Thurstone) 
have been achieved. 

Reproducibility of the factors was investigated by 
using the Burt-Tucker * (Burt, 1948, p. 185; Tucker, 
1951) index of factor similarity. To provide some 
psychological content for the factors, items from the 
scales defining the factors were submitted to 10 psy- 
chologists with MMPI test experience for inspection 
and written comments as to what they thought the 
factors might be measuring 


might thereby 


*One can determine the significance of any factor 
loading hj on any dimension or Factor C, for the & 
dimensions retained, by computing the standard error 
of hj; using the following formula: 


1—h? 


*" 


where /i,? is the communality of the variable in question 
and N is the sample size. Thus if the factor loading 
lies outside the range + o 1.96 h,,, then the population 
value could not be zero at the 95° 
fidence. 

*Burt and Tucker independently conceptualized 
one possible solution to the problem of factor simi- 
larity as involving an index of proportionality among 
the factor profiles. These indices range from +1.00 
to —1.00 and can be interpreted much as Ff in ac- 
counting for percentage of variance Accordingly, one 
must obtain an index greater than .707 to insure 
that one factor is more like another than it is un- 
like that factor. 
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RESULTS 


Number of Factors 


Fifteen factors accounted for 98°% of the 
common factor variance in the two patient 
groups and for 96% of the variance in the 
two community groups. The number of fac- 
tors extracted and the manner in which they 
cut across the classifications of Wiener, Har- 
ris, and Hathaway and McKinley would 
strongly suggest that the MMPI is indeed 
factorially complex. It took five factors in 
each of the patient groups and seven in each 
of the community groups to account for only 
80% of the variance (based on the unrotated 
factor matrices). The first factor in each of 
the analyses accounted for approximately 51, 
46, 37, and 39% of the observed variance in 
the respective unrotated matrices. These re- 
sults would seem to suggest the untenability 
of the two-factor position. 


Factor Replication 


Seven factors were reproduced, i.e., had in- 
dices of factor similarity of at least .70, in 
all four analyses, four factors were replicated 
in three, and three factors appeared in two. 
Patients were more like one another than 
they were like community Ss (12 factors in 
common), and likewise there was greater con- 
gruence among the community Ss (10 factors 
in common): the same thing might be said 
for the intrascx comparisons (there were 10 
factors in common within each sex group). 
Agreement in factors is greater in this study 
than heretofore reported in the MMPI litera- 
ture. This agreement was probably to a con- 
siderable extent a function of objectivity in 
computation, but to some degree it might also 
be partly spurious, since there exists the pos- 
sibility that the amount of item overlap not 
only increased the size of the correlations, 
but may have also introduced correlation 
among factors across groups. 


5 Tables 2-7, containing indices of factor similarity 
(Table 2), factor names and variances (Table 3), 
means and standard deviations (Table 4), scale in- 
tercorrelations (Table 5), unrotated factor loadings 
(Table 6), and Varimax rotated factor loadings 
(Table 7) for each of the four groups of Ss, have 
been deposited with the American Documentation In- 
stitute. Order Document No. 6119, remitting $1.75 
for microfilm or $2.50 for photocopies. 


Lingoes 


Order of Factors 


To facilitate presentation, the order of fac- 
tors was determined first by consideration of 
factor stability and second by the relative 
sizes of the factors. In accord with this or- 
dering, new factor numbers were assigned, 
e.g., Factor I (which differs from the first 
factor of any particular study by this no- 
tation). The size of each factor was deter- 
mined as follows: based on the sum of the 
squared factor loadings summed across groups 
(when replicated), the percentage of the total 
observed common variance of the four groups 
was obtained, e.g., the variance of the four 
groups (based on the SMCs) was 120.26, 
while the total variance of the four groups 
on the first factor in each analysis (based on 
the sum of the squared factor loadings) was 
30.51, and therefore Factor I accounted for 
approximately 25% of the variance. 

Although there were both interdiagnostic 
and intersex differences, these were found to 
be for the most part a function of the size of 
the various factor loadings rather than major 
differences in the relative magnitudes or pat 
terns of the loadings. Consistent differences 
across diagnostic groups and sexes will be 
mentioned when they occur, but only those 
factors will be presented which were repli- 
cated, based on the logic that greater confi- 
dence can be placed in them. Each of the 
factors will be related to the series of factor 
studies carried out by Comrey (1957a, 1957b, 
1958a, 1958b, 1958c) and Comrey and Marg- 
graff (1958) on the items of the MMPI clini- 
cal scales. Since there is an overlap between 
the items for Comrey’s factors and the items 
constituting the Harris subscales, the oppor- 
tunity for objectively but indirectly assessing 
similarity across factors and studies becomes 
possible. 


Manner of Presentation of Results 


Results are presented in the following or- 
der for factors obtained in all four analyses: 
(a) the factor’s number and name; (/) the 
most highly loaded subscales on the factor 
with their loadings for the four groups, i-e., 
Pm, Pf, Cm, and Cf; (c) any additional sub- 
scales which have loadings greater than .39 
for at least two analyses; (d) the indices of 
factor similarity; and (e) judges’ descriptions 
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of the factor based on (b) above. For factors 
which were replicated in at least three analy- 
ses, Items (a@)—(d) above are given with an 
indication of the groups included appearing 
alongside of the factor name. 


Factors Found in All Four Analyses 


Factor 1: General Maladiustment. There 
were seven scales with high loadings in all 
four groups: 


DO _Depression-Obvious (.88, .89, 
dD, Subjective Depre 
dD, Mental Dullness (.90, .90 

Ds Brooding (.85, .83, .73, .74 
Lassitude-Malaise (.85 00, 
Pda Self-Alienation 3 


SCon Lacl f | » Master 


ssion 9? 


Other subscales with loadings greater than 
.39 for at least two samples were: 


dD, Psychomotor Retardation 
D; Physical Malfunctior 

Il S ymatic Cor 
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lignancy 
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The Pm factor best exemplified Factor I. 
Indices of agreement with the remaining three 
groups were: .99, .95, .94. 

The 10 judges reviewing the 69 items from 
the seven with the highest loadings 
(overlapping items were offered only once) 
agreed in describing this factor as one of gen- 


scales 


eralized anxiety, depression, guilt, pessimism, 
and anxious worrying. It is much like the first 
factor found in 


and 


a number of other investiga- 
y appropriately be called the 
General Maladjustment Factor or as Kasse- 
baum, Couch, and Slater (1959) prefer to 
call it. a factor of Ego Ego 
Strength. This factor includes the majority 
of the items 1957h, 
for the D, Hy, and Pd 
scales and his second factor of the Sc scale 
(Comrey & Margeraff, 1958). 
Factor 11: Denial of Social Anxiety. Five 
scales had high loadings in all four groups: 


tions may 


Weakness vs. 


from Comrey’s (1957a 


1958c) first factors 


Hy, Denial of Social Anxiety (.82, .87, .81, .87 

PdS_ Psychopathic Deviate-Subtle (.69, .71, .70, .69 
Pd; Social Imperturbability (.73, .83, .83, .87 

WaS Hypomania-Subtle (.39, .57, .55, .38 

Va; Imperturbability (.62, .80, .63, .77 


No other subscales had loadings greater 
than .39 in any sample. This factor was also 
best represented by the Pm group. Indices of 
agreement with the other three groups were: 
.94, .95, and .93. 

The 47 items comprising the above five 
scales elicited such descriptions as:- social 
ease, interpersonal dominance, extraversion. 
and social adaptability (middle scorers) and 
veiled aggression, social obtuseness, denial of 
social anxiety, and naiveté (high scorers). 
Both Factors I and II had items referring to 
anxiety, but in contrast to General Maladjust- 
ment, the items in the present factor bear 
upon inter- rather than intrapersonal vari- 
ables. This factor includes the majority of 
items from Comrey’s (1957b, 19582, 1958c) 
second factor of the Hy scale, fourth factor 
of the Pd scale, and the first factor of the 
Ma scale. It is also of interest to note that 
this factor bears some resemblance to Kasse- 
baum et al.’s (1959) second factor of Intro- 
version-Extraversion. 

Factor III: Loss of Control. Four scales 
had high loadings: 


Lack of Ego Mastery 


Control (.55, .71, .52 


Sez t of Inhibition and 


Sensorimotor Dissociatior 


MaO Hypomania Obvious 43 


Maz Psychomotor Acceleration 


Only one other subscale had loadings greater 
than .39 in at least two samples, i.e., DS De 
pression-Subtle (—.20, —.57, —.18, —.40). 
The Pm factor showed the highest indices of 
agreement with the other groups, i.e., .92, .88, 
and .93. 

The consensus classified the 45 items from 
the first four scales into the areas of: loss of 
emotional and impulse control, restlessness, 
incipient ego disintegration, and possible body 
image distortion. This factor would appear to 
be measuring at the high end, the failure of 
neurotic defenses, and in the middle ranges, 
inadequate control mechanisms. The highest 
factor loadings tended to appear in the two 
female groups, which might indicate a higher 
saturation of this factor among females. There 
was little congruence between these subscales 
and Comrey’s (1958c) 
Marggraff's (1958) 
Sc scales. 
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Factor IV: Denial of Distrust and Hostil- 
ity. High loadings (greater than .39) were 
obtained on three variables: 


Hy: Need for Affection and Reinforcement from Others 
(.63, .59, .78, .58) 

PaS Paranoia-Subtle (.90, .87, .93, .91) 

Pas Moral Virtue (.84, .92, .93, .89) 


Indices of .96, .95, and .95 were obtained 
between the Cf factor and factors in the re- 
maining groups. Some indirect support for 
this factor can be adduced from Comrey’s 
(1957b, 1958b) analyses of the Hy and Pa 
scales, inasmuch as his third factors from both 
of these scales overlap in items with the Hy. 
and Pa; subscales. 

Descriptions such as: naiveté, denial of dis- 
trust and hostility, dependence upon the good 
will of others, impunitiveness, tolerance, re- 
pressive and submissive behavior, were ob- 
tained from the 25 items constituting the 
above three scales. Both the present factor 
and Factor II involve repression, denial, and 
naiveté, but whereas the latter deals with in- 
teraction ease, the present factor was more 
specifically concerned with defenses against 
both hostile and aggressive impulses. 

Factor V: Familial Discord. The following 
two scales had the highest loadings: 


Pd, Familial Discord (.76, .74, .68, .72) 
Sceia Social Alienation (.46, .41, .50, .37 


One other scale had loadings greater than 
39, ie., PdS Psychopathic Deviate-Subtle 
(.26, .40, .14, .43). The Pf group had indices 
of .94, .84, and .94 with the three remaining 
groups. 

This factor was described as measuring es- 
sentially intrafamilial conflict, unresolved de- 
pendency strivings, feelings of being mis- 
trusted, misunderstood, deprived, and _iso- 
lated. There were 31 items appearing in the 
first two scales. Some agreement between the 
items of Pd, and Comrey’s (1958c) 12th fac- 
tor on the Pd scale was evident, although 
Comrey and Marggraff’s (1958) analysis of 
the Sc scale split the Sc, items among two 
factors. 

Factor VI: Inhibition and Apathy. Two 
variables had high loadings on this factor: 


DS Depression-Subtle (.51, .43, .69, .50) 
Dz Psychomotor Retardation (.46, .49, .60, .71) 
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No other subscales had loadings greater 
than .39 for at least two samples. The commu- 
nity S groups had higher loadings on this fac- 
tor than the patients. The Cf factor yielded 
indices of .86, .83, and .85 with the other 
three factors. 

The 28 items from these two scales were 
described in the following terms: inhibition, 
overcontrol, rigidity, obstinacy, feelings of 
apathy and inertia, emotional constriction, 
etc. D. items were distributed on two of 
Comrey’s (1957a) D factors. Although only 
two variables had sufficiently high loadings to 
define this factor, it is nonetheless a common 
factor according to the factor model employed 
in this study, i.e., using SMCs. However, one 
might prefer to call such factors as these near 
specifics (Wrigley, 1957). The psychological 
or statistical significance of such factors is, 
however, an issue apart from the common- 
specific distinction and such factors will be 
included if replicated. pending their further 
investigation. 

Factor VII: Social Nonconformity. Only 
one scale had consistently high loadings in all 
four groups: 


Pd, Authority Conflict (.69, 41, .71, .71 


The second highest loadings for the two 
patient groups appeared on Ma, Amorality, 
with loadings of .56 and .77 for males and fe- 
males, respectively. In the community groups, 
there was no consistency. Since the Pm factor 
yielded the highest agreement indices with the 
other factors, i.e., .81, .81, and .82, and since 
the patients had a second scale comparable in 
loadings to the first (Pd.), it was decided to 
consider this a patient factor, defined by both 
Pd». and May. 

General agreement was evident in the com- 
ments of the judges. e.g., rebelliousness, argu- 
mentativeness, immaturity, hostility directed 
specifically against authority, potential for 
acting out, nonconformity, assertion of inde- 
pendence, and egocentricity. Only 16 items 
from two scales represented this factor. Pd: 
items did not cluster together in Comrey’s 
(1958c) analysis of the Pd items. 


Factors Found in Three Analyses 


Factor VIII: Hostility-Alienation-Projection 
(Pm, Pf, and Cf). Two scales had high load- 


= 
= 
i 
agg 
é 
A 
t 
a 


Factors of MMPI Subscales 81 


ing in all three samples: 


Pd Social Alienation (.47, .53, .66) 
Pa, Ideas of External Influence (.75, .89, .86) 


Additional subscales with loadings greater 
than .39 were: 


PdO Psychopathic Deviate-Obvious (.33, .47, .44) 
PaO Paranoia-Obvious (.72, .&3, .19) 
Paz Poignancy (.41, .42, .14) 


Scia Social Alienation (.51, .50, .31 


The Pi factor had indices of similarity of 
.95 and .84 with the other two factors. There 
were 29 items in the first two scales. This 
would appear to be primarily a patient factor. 

Factor 1X: Somatic Complaints (Pm, Pf, 
and Cm). There were only two scales with 
loadings greater than .39 in at least two 
samples: 

Hy, Somatic Complaints 
Scz Sensorimotor Dissociation 


66, .74, .65 
46, 50, .78 
The Pm factor yielded indices of .89 and 
.71 with the other two factors. This too is 
probably a patient factor. There were 33 
items in these two scales. 
Factor X: Poignancy (Pf, Cm, and Cf). 
Only one scale had consistently high loadings: 


Paz Poignancy (.67, .81, .73 


The Cm factor had indices of .80 and .89 
with the two female factors. In the commu- 
nity groups, PaO had the next highest load- 
ings (.59 and .71, respectively), while in the 
patient groups, this scale was associated with 
Pa,, which appeared on Factor VIII. There 
were only 9 items in the above scale and this 
factor was considered to be a near specific. 

Factor XI: Poor Health (Pm, Cm, and 
Cf). The three groups had one scale in com- 
mon with high loadings: 


D, Physical Malfunctioning 


63, .58, .48 


The Cm factor had an index of .83 with the 
Cf factor, but only a borderline index of .70 
with the Pm factor. There were 11 items in 
this scale. In addition to Dz, the community 
samples had loadings of .61 and .46 on Hys 
Lassitude-Malaise, and consequently this fac- 
tor was considered to be a nonpatient factor. 


Factors Found in Two Analyses 


Three factors which accounted for 64% of 
the variance collectively appeared too small 


and doubtiul to deserve anything more than 
a mention. Factor XII had high loadings on 
MaO and Ma, for the community groups; 
Factor XIII had high loadings on MaS for 
the Pm and Cf groups; and Factor XIV had 
high loadings on Ma, for the two patient 
groups. 


DISCUSSION 


The present study of six standard MMPI 
scales has demonstrated a dimensionalization 
which is inconsistent with the two-factor posi- 
tions of both Wiener (1948) and Wheeler 
et al. (1951). Instead, the MMPI appears to 
have: (a) a complexity not coextensive with 
the presently constituted six scales, and (>) 
a greater simplicity than might be inferred 
from the 28 titles given to the Harris sub- 
scales (1955). Implications of the present 
findings will be discussed under the following 
topic headings: (a) the multidimensional po- 
tentials of the MMPI, and (4) the “obvious- 
subtle” distinction. 


The Multidimensional Potentials of the 
MMPI 


The term potentials has been chosen ad- 
visedly, since a test does not per se possess 
dimensionality, but only in relation to a speci- 
fied population or set of operations which pro- 
duce the test responses. The results of this 
paper would lead one to conclude that the 
MMPI can reliably measure considerably 
more than two dimensions, e.g., neuroticism 
or psychoticism. 

Of the 15 factors extracted there are seven 
which have emerged from four analyses (ac- 
counting for approximately 64° of the total 
observed variance). There are a further four 
derived from three analyses, accounting for 
an additional 19° of the variance. Admit- 
tedly, the number of factors obtained is 
partly a function of the number of variables 
included and may be to some extent an arti- 
fact resulting from item overlap. Neverthe- 
less, to conclude that because the first two 
or three factors account for roughly half the 
variance that the test is just measuring this 
or that aspect of personality, e.g., sickness or 
general maladjustment, or is influenced pri- 
marily by some response set, is not justified 
on the present evidence. 
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Despite he increased confidence in the fac- 
tors resulting from replication in a number of 
heterogeneous samples, the logical possibility 
is here raised of committing what might be 
called the identity error, i.e., inferring identi- 
cal factors from identical covariance patterns. 
It may well turn out that different groups re- 
spond in like manners to the same items for 
entirely different reasons, either because the 
items have different meanings to different in- 
dividuals or that the items themselves are 
complex factorially and have differing validi- 
ties. Scales are being developed based on the 
replicated factors to test this hypothesis on 
various groups of Ss. 


The “Obvious-Subtle” Distinction 


An inspection of the rotated Varimax load- 
ings on the eight Wiener scales revealed the 
O scales were scattered over at least three 
factors in all four analyses. In general, the 
same findings applied to the S scales. Further- 
more, as in the case of the PaO items, this 
scale was associated with different factors, 
i.e., With Pa, in the patient samples and with 
Paz in the community samples. If the Ob- 
vious and Subtle scales from Hysteria had 
been added, HyO would have probably ap- 
peared on Factors I, IX, and XI, while HyS 
would probably have come out on Factors IT 
and IV. Whatever else the Obvious-Subtle 
items might be measuring, Wiener’s conclu- 
sions, based on the intercorrelations among 
these scales and other variables, do not fully 
explain the above results, ie., “They (the 
Obvious and Subtle scales) seem to have the 
same meaning and to perform the same func- 
tions for each scale” (Wiener, 1948, p. 169).* 
Wiener’s contention would seem to be on 
firmer ground in respect to the O scales, since 
they do yield similar sign patterns across a 
large number of factors (they. are positively 
interrelated). It would appear, however, that 
an analysis of his scales in the context of the 
Harris subscales makes previously considered 
specific variance now common. This addi- 
tional source of common variance is, more- 
over, greater than that attributable to obvi- 
ousness per se, i.e., content does seem to make 
an important difference. 


® Italics and parentheses present author's. 
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It would appear from the present evidence 
that obviousness and subileness may have 
different as well as common meanings and 
functions within each scale and that the dis- 
tinction Wiener has made, while useful in in- 
vestigating response sets, must be accordingly 
modified to admit of many kinds of obvious 
and subtle items. This conclusion would ap- 
pear to be supported by that of Hanley 
(1959) in respect to his investigation of ac- 
quiescence. 


Conclusions 


It can be said that this study has demon- 
strated the multidimensional potential of the 
MMPI and has lent indirect support to the 
configural approach to interpretation ad- 
vanced by the authors of this test. Although 
item overlap may have introduced extra 
sources of variance, thus inflating the SMCs, 
it is felt that the subscale approach used here 
is a valuable one for clarifying factor struc- 
tures. Because only six of the clinical scales 
were analyzed, the inclusion of other MMPI 
scales might have modified, clarified, or in- 
troduced factors to undetermined de- 
One might expect, however, that Pt 
would have appeared on Factor I and that 
Hs would probably have split on Factors III, 
IX, and XI had they been incltided. 

It is believed that this study complements 
that of Comrey to the extent that some ob- 
jective evidence can be adduced that the fac- 
tors he obtained within scales are similar from 
scale to scale. This evidence is, of course, in- 
direct, since while there is congruence be- 
tween some of his factors and the Harris sub- 
scales, the correspondence is far from perfect. 
Factor scales to be developed based on this 
analysis are intended to supplement and not 
to replace the standard MMPI scales, al- 
though they might well replace the Harris 
subscales as an interpretative aid. 


some 


gree. 


SUMMARY 


This investigation was concerned with test- 
ing the merits of the two-factor versus the 
multifactor hypothesis of the MMPI as rep- 
resented by Wiener and by Harris, respec- 
tively. Harris classified MMPI items under 
28 categories, while Wiener chose only 2, i.e., 
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Obvious and Subtle, as an adequate descrip- 
tion of the test’s complexity. 

From the 36 subscales analyzed (represent- 
ing six clinical scales) in four samples of sub- 
jects (patients and nonpatients, males and 
females), seven factors were replicated in 
four analyses and four in three. The 9 of 
these 11 factors which could be readily identi- 
fied were: General Maladjustment (1), De- 
nial of Social Anxiety (II), Loss of Control 
(III), Denial of Distrust and Hostility (IV), 
Familial Discord (V), Inhibition and Apathy 
(VI), Social Nonconformity (VII), Hostility- 
Alienation-Projection (VIII), and 
Complaints (IX). 

It was concluded on the basis of the analy- 
ses that: (a) reliably more than two dimen- 
sions can be demonstrated; (5) the “obvious- 
subtle” distinction, while having some perti- 
nence for the study of response sets, cannot 
fully explain the present factor results and 
that one must admit of many kinds of obvi- 
ousness and subtleness; (c) while the present 


Somatic 


study is far from definitive, the fruitfulness 
of performing analyses on the basis of sub- 
scales rather than composite scales is sug- 
gested; and (d) evidence was offered relating 
the series of factor studies carried out by 


Comrey at the item level to the scale analysis 

of this study. 
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Recently Levitt (1957) published a survey 
of the studies concerning the effectiveness of 
psychotherapy with children on the model of 
Eysenck’s (1952) now classic enquiry with 
adults. Levitt concludes that the published 
figures do not support the contention that 
psychotherapy with children is more effective 
than nonintervention. His baseline data are 
derived from two studies: that by Witman 
and Keller (1942) and that by Lehrman et al. 
(1949). Averaging these studies shows that 
untreated controls have an improvement rate 
of 72.5%, whereas a survey of 18 studies 
of the outcome of psychotherapy with chil- 
dren shows an average improvement rate of 
67.05%. More strictly comparable with the 
baseline data studies is the averaging by 
Levitt of 17 follow-up studies of the effects 
of psychotherapy on children. These yield a 
figure which shows that 78.22% of the cases 
were improved. 

Levitt’s article is a challenge to all workers 
in the field of child psychotherapy, indicating 
as it does the relative ineffectiveness of this 
field of endeavor. Levitt is careful to point 
out that it is mot proven that psychotherapy 
is ineffective; but the converse, that psycho- 
therapy is effective, is also not proven. 

The present study was undertaken as an 
attempt to isolate from the published litera- 
ture some of the factors contributing to suc- 
cess or failure in psychotherapy as rated at 
the close of treatment. Follow-up studies were 
excluded since criteria for follow-up success 
were not detailed in any definable way. 


THE Sources OF UNCERTAINTY 


The problem of establishing a baseline. At 
this juncture the writer wishes to note the 
difficulty of arriving at baseline data for the 
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evaluation of the effectiveness of psychother- 
apy. The method used in the two studies 
(Lehrman et al., 1949; Witman & Keller, 
1942) accepted by Levitt has serious draw- 
backs. These studies used as controls cases 
who had made an approach to the clinic and 
had been put on the waiting list but who, for 
one reason or another, did not accept the 
offer of treatment when it was later made 
available. Levitt says that such cases were 
in all respects similar to treated cases, except 
for the fact that they received no treatment. 
While the published literature contains no 
studies where the controls have been ob- 
tained by any other method (so that no bet- 
ter statistics are available as a basis for com- 
parison), it is, however, not entirely a valid 
method of obtaining controls. 

For one thing, too little is known about the 
factors intervening between first clinical con- 
tact and later acceptance for treatment, and, 
for another, it is common experience at all 
clinics that many cases on the waiting list 
clear up before treatment can be given. Too 
little is known about the dynamics of this, 
but at least two factors are known to account 
for it: (@) cases where the approach to the 
clinic is made in a temporary crisis, which 
subsides while the case is pending, and (6) 
cases where a healing dynamic process is ini- 
tiated by the parents’ recognition of their need 
for help, so that by the time help is available 
it is no longer needed. 

As all these factors are operative for both 
the treated and untreated groups, is it not 
possible that the treated groups represent the 
more serious, intractible problems, while at 
least a proportion of the untreated groups are 
minor or transient problems? 

However, in the absence of any better sta- 
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tistics, these must be accepted, albeit with 
reservations, whose very nature demands that 
conclusions drawn from them should be highly 
tentative. 

Comparison of the studies showing high 
success with those reporting low success. The 
four studies reporting the highest percentage 
of success (Cohen & Davis, 1934; Irgens, 
1936; Newell, 1943; Witmer & others, 1933) 
were compared with the four studies report- 
ing the lowest (Brown, 1947; Christianson, 
Gales, & Coleman, 1934; Cunningham, West- 
erman, & Fischhoff, 1956; Maas, Kahn, 
Stein, & Sumner, 1955). This brought to 
light several features that were blurred in 
Levitt’s (1957) treatment of the data. 

The influence of the date of the study. 
Most unexpectedly, it was found that the 
cases treated in clinics in the high success 
studies all date from 1931-1932 despite the 
divergence of dates of publication; whereas 
three of the four low success studies date 
from after 1945. The one exception among 
the low success studies (Christianson et al., 
1934), which also dates from 1931-1932, is 
differentiated from the other studies, which 
all deal with major clinics in major cities, by 
being concerned with the northern New Jer- 
sey clinics—i.e., the possibility must not be 
ignored that this study may reflect a lower 
level of service than the others. 

The entire 18 studies were then arranged 
into two categories, dichotomized at the av- 
erage level of success (67%), with the fol- 
lowing findings as regards dates: 


Highs (over 67% 


Author Percentage Date 


1936 
1934 
1943 
1933 
1929 
1931 
1951 
1936 
1952 
1939 


Irgens 

Cohen and Davis 
Newell 

Witmer & others 

Lee and Kenworthy 
Burlingham 

Barbour 

Hubbard and Adams 
Reid and Hagan 


Carpenter 


One of the studies included by Levitt that re- 
ported the second lowest percentage improvement 
(Canaday, 1940) was excluded from this study as 
it is an unpublished thesis and therefore inaccessible 
to study at this distance. The four “lows” in this 
study are, accordingly, four of the five lowest in- 
cluded by Levitt. 


Lows (under 67%) 


Author Date 


Percentage 


La More 66.0 
65.3 


65.0 


1941 
1949 
1938 


Lehrman et al. 
Albright and Gambrell 


Christianson, Gales, & 
Coleman 1934 

Cunningham, Westerman, & 
Fischhoff 


Brown 


1956 
1947 
Canaday 1940 
Maas, Kahn, Stein, & 

Sumner 1955 

From immediate inspection it can be seen 
that, of the Highs, 7 of the 10 are stud- 
ies prior to 1940. Of the studies dated later 
than 1940, one (Barbour, 1951) is a British 
study and not strictly comparable with the 
other United States studies; another (Newell, 
1943), though published in 1943, is con- 
cerned with data gathered 10 years earlier, 
in 1933; and a third (Reid & Hagan, 1952) 
is related to residential treatment while all 
the others are concerned with nonresidential 
treatment. 

Among the Lows, on the other hand, only 
two of the eight studies are dated 1940 or 
earlier. One of these (Christianson et al., 
1934) has been discussed above and is not 
regarded as strictly comparable with the 
others in this group. The other (Canaday, 
1940) has not been available for detailed 
study, for reasons given above. 

Consequently, if the Barbour (1951) and 
the Reid and Hagan (1952) studies are 
omitted from the Highs for the reasons given, 
and the Christianson et al. (1934) study 
from the Lows, one finds that all the Highs 
date from before 1939, and six out of seven 
of the Lows date from after 1940. 

The problem of length and intensity of 
treatment. It was remarkable how little in- 
formation could be obtained about the im- 
portant questions: (2) How much treatment 
did each case receive, and (6) Was the per- 
son concerned with the treatment of the 
child a qualified psychotherapist? * This is 

2 The writer is aware that the term “psychothera- 


pist” is open to various divergent interpretations. It 
is here being used to refer to anyone having spe- 
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particularly curious in view of the vast body 
of information that was reported anent pre- 
senting symptoms, age, sex, socioeconomic 
status, parental attitude, and other back- 
ground variables. Only one study that was 
reviewed (Maas et al., 1955) included a 
breakdown of the success—failure percentages 
in terms of length of treatment. This study, 
which reports the lowest over-all improve- 
ment rate (only 43.1°7), shows that success- 
ful outcome is directly related to length of 
treatment. 


Percentage 
No. of Interviews N Improved 
None or single family contact 137 7.2 
Brief service (2-9 interviews) 77 221 
Short-term treatment (10-39 
interviews) 14 53.8 
Long-term treatment (40 or 
more interviews) 144 80.2 


By contrast one finds that in the four stud- 
ies reporting the highest level of success, the 
actual amount of treatment each child re- 
ceived was mostly extremely slight. Thus 
Irgens (1936) gives details of the treatment 
for 48 out of the 70 cases comprising the 
study. Of these, only four had more than 10 
interviews, and these were with an unspeci- 
fied worker. Her reports on treatment are 
characteristically along these lines: “Two in- 
terviews, summer camp, school adjustment.” 
Yet this study has the highest success level 
(85.7%). Compare this with Newell (1943) 
who reports 79.2% success for a procedure 
designed specifically for use in a school sys- 
tem and dealing exclusively with difficulties 
arising in school, where the great preponder- 
ance of the cases (percentage unspecified) are 
seen once only, treatment being concentrated 
on advice to teachers about handling, and 
secondarily to parents. This is in sharp con- 
trast with the later studies, notably the previ- 
ously cited one by Maas et al. (1955), where 
the majority of cases (57.4%) had over 10 
interviews each. 

Related to the problem of length of therapy 
is, of course, that of the qualifications of the 


cialized training in child psychotherapy, irrespective 
of theoretical approach, and including psychiatrists, 
clinical psychologists, and child analysts, but exclud- 
ing social workers, teachers, etc. 
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therapist. Once again there is a dearth of in- 
formation, but in the only study which does 
specify the type of skills of the therapist, the 
resulting information is interesting. Witmer 
and students (1933), who report an overall 
improvement rate of 79.0%, also broke down 
the figures to indicate whether the therapist 
was a psychiatrist or a social worker. Her 
findings are not as dramatic as Maas’, show- 
ing percentages varying from 73 to 82 for 
both psychiatrist and social worker for less 
than 15 interviews; but at over 15 interviews 
the psychiatrist achieves 97% success (N = 
30), and the social worker 84% (N = 208). 

The problems of the inclusion of unsuitable 
cases. It was noted that, where information 
was included about presenting symptoms, in 
every instance there were included small 
numbers of cases that are presumably either 
incurable (e.g., mongoloidism) or curable by 
somatic medicine (e.g., endocrine dysfunction- 
ing). Though the numbers are never large 
(the percentages are never in excess of 10% 
of all cases), their exclusion slightly improves 
the percentages for successful treatment. 


DISCUSSION 


In a sense, Levitt (1957) has put the con- 
temporary child guidance clinic to the test 
and found it wanting. But is this in fact 
so? Two main arguments suggest it is not. 
First, there are serious inadequacies (detailed 
above) in the data relating to the recovery 
rate of untreated children which render them 
suspect as a basis for comparison with treated 
cases. Secondly, the studies straddle a period 
in which child guidance clinics have not only 
been growing and modifying their practices 
but also adjusting to rapid social changes in 
the community as a whole, so that all other 
considerations apart (such as lack of uni- 
formity of procedures in individual child 
guidance clinics), it is unsatisfactory to ar- 
rive at conclusions based on the amalgama- 
tion of these studies, for amalgamation treats 
them all as equal; whereas, as has just been 
suggested, it is more probable that a close 
examination will reveal that child guidance 
practice in, say, 1933, had little in common 
with that obtaining in 1950 or 1955. How- 
ever, his study does show that present-day 
child guidance clinic practices are not highly 
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effective in dealing with children’s psycho- 
logical malfunctionings, though it does ap- 
pear that they were considerably more effec- 
tive in the early 1930's than they are now. 
Why should this be so? 

No definite and final answer can be given 
here, but is it not possible (from the intéernal 
evidence of the various studies reviewed) that 
during the past three decades child guidance 
clinics have undergone a variety of changes 
both in the nature of the service rendered to 
the community and, even more importantly, 
in the severity of the disturbances they at- 
tempt to deal with? 

It is suggested that during the past two 
decades, child guidance clinics have increas- 
ingly been dealing with deep-seated cases of 
disturbances requiring extensive psychother- 
apy and less with relatively minor cases that 
can be treated by manipulation of the envi- 
ronment.* Probably there is a multiplicity of 
reasons for this change, including increasing 
public awareness of severe childhood disturb- 
ance, increasing availability of trained thera- 
pists, possibly also an increased actual inci- 
dence of more severe disturbance, reflecting 
the social dislocation in the past two decades, 
and possibly many other factors as well. 


It is the writer’s contention that the ques- 
tion “How effective is psychotherapy with 
children” is still unanswered. 


SUMMARY 


In an attempt to throw further light on 
Levitt’s (1957) findings on the apparent in- 
effectiveness of psychotherapy with children, 
the unsatisfactoriness of the baseline data 
was discussed. The literature was re-reviewed, 
and the method of extreme groups was ap- 
plied. The four studies reporting the highest 
percentages of success were contrasted with 
the four reporting the lowest. It was found 
that: 

1. The studies reporting high percentages 
of success relate to cases treated during the 
years 1931-1932, whereas three of four stud- 


% This is borne out to some extent by the fact that 
early studies make no mention of psychotic chil- 
dren, while later ones do. Also, where treatment is 
specified, earlier studies rely heavily on environmen- 
tal manipulation, while later studies report increas- 
ing numbers of interviews with the patient himself. 


ies reporting low percentages are postwar 
studies. This relationship was found to hold 
good when a total of 18 studies were di- 
chotomized at the average percentage. 

2. There was little agreement among the 
studies regarding factors responsible for higher 
successes, except that length of treatment was 
related positively to successful outcome, as 

yas also the nature of the skills of the thera- 
pist. 

3. The wide divergences in criteria for suc- 
cess, and methods of treatment, render the 
averaging of a large number of studies an un- 
satisfactory method of evaluating the effec- 
tiveness of psychotherapy with children. 

It is contended that, as yet, there is no 
satisfactory answer to the question of how 
effective psychotherapy is with children. 
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REPLY TO HOOD-WILLIAMS 


EUGENE E. LEVITT 
Indiana University Medical Center 


The commentary by Hood-Williams (1960) 
casts some interesting and thought provoking 
light on my review of the results of psycho- 
therapy with children (Levitt, 1957b). Hood- 
Williams’ contentions are certainly worthy of 
discussion, but they hardly constitute a head- 
ing so encompassing as “revaluation,” as I 
shall attempt to demonstrate. 

The influence of the date of the study. lf 
one accepts Hood-Williams’ analysis, its effect 
is to emphasize the inference that treatment 
has been ineffective. We must allow that the 
work since 1945 is more representative of 
what we now call psychotherapy than work 
done previous to that time. Hood-Williams 
suggests, as an explanation for the date fac- 
tor, that clinics are now dealing with more 
serious problems than they used to. Possibly. 
It appears more likely, though, that the lower 
recent improvement rate is a function of ad- 
vances in diagnostic methods, greater knowl- 
edge of personality dynamics, and increased 
research sophistication. 

On the other hand, one should not hasten 
to accept Hood-Williams’ analysis. In my 
Table 5 (Levitt, 1957b, p. 194), I pointed to 
a potential positive relationship between out- 
come and number of points used in the out- 
come rating scale. All of the reports using 
4- and 5-point scales show above average im- 
provement rates, and all but one were car- 
ried out prior to 1936. Both of the reports 
using dichotomous scales have below aver- 
age improvement rates; they were reported in 
1955. Thus there are two possible causal fac- 
tors here. 

The problem of establishing a baseline. I 
qualified the comparability of treated cases 
and so-called “defector” controls with the 
phrase that they were similar “so far as is 
known” (Levitt, 1957b, p. 190). I chose what 
appeared to me to be the best kind of con- 


trol, and Hood-Williams is in error when he 
says that there have been no other methods 
published. At least five control techniques 
have been employed in the past (cf. Levitt, 
Beiser, & Robertson, 1959). 

The evidence accumulated since the publi- 
cation of my 1957 paper suggests that the 
idea that the treated cases are serious dis- 
turbances, while the defectors are transient 
disorders, is also erroneous. One study (Levitt, 
1957a) shows that treated and defector child 
cases do not differ on some 61 factors, in- 
cluding two clinical estimates of severity of 
disturbance, and eight other factors rela- 
ing to symptoms. A second study (Levitt, 
1958a) reported that expert judges found no 
difference in severity of symptoms between 
treated and defector child cases, judging from 
case records. The mean severity ratings, based 
on a five-point scale, were 2.98 for treated 
cases and 3.02 for defectors. 

Follow-up interviews with parents of 142 
defector cases (Levitt, 1958b) disclosed that 
only 13% attributed defection to improve- 
ment in the child’s symptoms. Over half 
blamed either the clinic or environmental cir- 
cumstances. In an earlier telephone follow-up 
of 40 cases (Inman, 1956) 14, or 35°, gave 
improvement as a reason for defection. How- 
ever, all but three of these parents saw fit to 
volunteer at least one other reason as well, 
which hints at some defensiveness about" the 
improvement explanation. 

Putting the two studies together, 18% of 
the parents gave improvement in the child as 
a reason for defection. This percentage does 
not seem large enough to warrant the state- 
ment that pretherapy remission distinguishes 
treated cases and defectors, though it may 
have affected the intergroup comparison to 
some extent. However, we have no independ- 
ent evidence to suggest that the same propor- 
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tion would not hold for the treated cases as 
well. A parent might well feel that the child 
had improved and yet not see fit to break 
contact with the clinic. 

I do not claim that any of these studies or 
lines of reasoning is conclusive, but taken to- 
gether, their weight seems opposed to the hy- 
pothesis that the defector cases are less dis- 
turbed as a group than cases who go on to 
receive treatment. 

The problem of length and intensity of 
treatment. It was not my intention to deal 
with these factors in my review, primarily be- 
cause the needed information was lacking in 
most reports, as Hood-Williams notes. The 
problem is certainly noteworthy, especially 
since it is by no means as simple to grapple 
with as it might appear at first glance. Con- 
sider Maas, Kahn, Stein, & Sumner’s (1955) 
table presented by Hood-Williams. If we ac- 
cept these data as representative, and com- 
pare them with the defector results, the in- 
ference is not simply that “successful outcome 
is directly related to length of treatment.” 
The data show that improvement is a func- 
tion of at least 40 treatment interviews: a 
lesser number actually makes the patient 
worse. Would the practicing psychotherapist 
care to support the contention that 35 treat- 
ment interviews is worse than no treatment 
at all? Is there an hypothesis that could ex- 
plain this peculiar phenomenon? And why is 
it that only 7% of the Maas’ sample having 
no contact, or only one family contact, with 
the clinic showed improvement, while the fig- 
ure is 10 times as great for approximately the 
same degree of contact among defectors? 

Attempts to determine the relationship be- 
tween outcome at close of treatment and 
number of interviews are confounded by a 
time factor. Twenty interviews usually span 
a longer time period than five interviews. 
Should a positive relationship appear, we 
cannot be certain that it is not due simply 
to the passage of time alone, quite apart from 
number of interviews. This point is exempli- 
fied by Table 4 in my review (Levitt, 1957b, 
p. 193) and by the Denker (1946) study of 
untreated adults. Both show increases in im- 
provement rate with time, without reference 
to intensity of treatment. 


I agree entirely with Hood-Williams that 
the matters of intensity and type of treat- 
ment, and the qualifications of therapists are 
potentially significant, and should be thor- 
oughly discussed in any evaluation study. (In 
the review, I referred to the “generally poor 
caliber of methodology and analysis” in the 
studies covered therein.) The fact that this 
kind of information is usually lacking sug- 
gests that one should be circumspect in draw- 
ing inferences from the review. It does not 
impeach the findings. 

The character of the evaluated treatment is 
taken up in some detail in the evaluation 
follow-up study at the Institute for Juvenile 
Research in Chicago (Levitt et al., 1959). 
We report that almost half of the cases were 
treated by students or staff members with 
one year of experience or less. Only one-third 
were seen by therapists with more than three 
years of experience. A third had to switch 
therapists at least once during the course of 
therapy, and some had been handled by three 
or even four therapists. Ninety per cent of 
the individuals were treated on a once-a-week 
basis, and only 15% by psychiatrists. 

If similar conditions of therapy have been 
prevalent among child guidance clinics, then 
Hood-Williams’ point might have greater im- 
port. But we do not know, and probably will 
never know. In fact, we do not even know for 
certain that experienced therapists will be 
more effective, though it seems to be a plau- 
sible hypothesis. 

Hood-Williams has raised some reasonable 
points. If his paper serves to focus additional 
attention on the need for continuing evalua- 
tion of our therapeutic efforts, and on the 
necessity for tightening up methodology and 
technique, then its publication is amply justi- 
fied. But his conclusions concerning the re- 
view of results are too sweeping and incau- 
tious for the limited and equivocal analyses 
which he carried out. 
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DEVIANT RESPONSES IN COLLEGE ADJUSTMENT CLIENTS: 
A TEST OF BERG’S DEVIATION HYPOTHESIS! 


AUSTIN E. GRIGG ano JOSEPH S. THORPE 


University of Texas 


Berg, in a series of provocative papers 
(1955, 1957, 1958), has insisted that indi- 
viduals when taking almost any type of non- 
achievement test make many responses which 
are similar to responses made by most others, 
but they also make responses which are un- 
like those of others in the general population 
and are more like responses made by mem- 
bers of some special subpopulation to which 
the particular individual belongs. Thus, some 
individuals respond to test items of various 
kinds in a manner more typical of lawyers 
than of engineers or of the general popula- 
tion. Some make responses more typical of 
psychotics than of psychopaths or of the gen- 
eral population. 

Berg believes that the particular test con- 
tent is not important, but that the use of 
clearly delineated criterion groups and an 
analysis of response tendencies peculiar to 
the particular group are the basic essentials 
in using deviant responses to study person- 
ality. He hypothesizes that the tendency to 
give deviant responses is a generalized one 
and is exhibited in situations other than the 
testing situation. In regard to personality mal- 
adjustment, he postulates, in effect, that those 
who give maladaptive responses in everyday 
living will give “deviant” responses when 
scored against a key of typical responses for 
most any kind of psychological test. 

Using Berg's approach, Barnes (1956) 
found that a simple count of the total num- 
ber of atypical responses on the MMPI cor- 
related .87 with Scale 7 and .93 with Scale 8. 
In an earlier study, this same author (1955) 
was able to discriminate between a large 


1 This research was supported by funds awarded 
to the University of Texas Testing and Counseling 
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sample of controls and psychiatric patients 
on the basis of how the subjects responded 
when checking personal preferences for a 
small number of abstract geometrical designs. 
Barnes also studied clinical subgroups by the 
deviant response method. 

Although Berg and his students are cur- 
rently championing the deviant response ap- 
proach, it should be remembered that as 
early as 1910, Kent and Rosanoff used a 
“deviation” approach when they altered the 
Jungian word association technique and ap- 
praised results in terms of common and un- 
common associations to stimulus words 
diagnostic of personality maladjustment. 

Following Berg’s reasoning, if one takes a 
normal sample of college students and studies 
the common and uncommon responses they 
make when asked to describe themselves on 
an adjective check list and then constructs an 
adjective list from the most commonly and 
the most rarely selected words, one would pre- 
dict that with a new population those who be- 
come therapy clients would check more of 
the rare words and fewer of the common 
words than would the nonclient members of 
the population. The present study tests the 
validity of this prediction. 


as 
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METHOD 


In February 1958, during their regular freshmen 
testing sessions, the entering freshmen checked their 
self-descriptions on the 300-item Gough Adjective 
Check List (1955). An item analysis was made of 
the responses by the 181 males and 92 females to 
ascertain the most commonly checked adjectives and 
the least commonly selected adjectives. The original 
analysis was made separately for each sex, but it 
was observed that certain “sex-biased” adjectives 
could be eliminated from the data and the results 
would be applicable to both sexes.. (The word 
“sweet,” e.g., was commonly selected by girls but 
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rarely by boys.) Seventy-two items of commonly 
checked and uncommonly checked adjectives were 
selected for the revised form of the test by includ- 
ing all adjectives selected by 84% or more of the 
freshmen and also all adjectives selected by fewer 
than 16% of the group. The final selection of 72 
adjectives consisted of 33 commonly selected words 
and 39 uncommonly selected words. 

In September 1958, this revised 72-item adjective 
check list was given to approximately 1,400 enter- 
ing freshmen. The freshmen were asked to check the 
adjectives that best described them. At the close of 
the academic year this pool of data was divided into 
those freshmen who had reported during the year 
for psychiatric help at the Student Health Center or 
to the University of Texas Testing and Counseling 
Center for personal adjustment counseling, those 
who had reported for other types of counseling such 
as vocational-educational counseling, and into those 
who did not seek any type of professional counsel- 
ing or psychiatric aid. In the present study, the 
counseling cases were classified as to type after each 
interview by the counselor, and those who remained 
as educational-vocational did not receive assignment 
to personality maladjustment by their counselors. A 
study was made of the “deviant” responses of the 
psychiatric treatment group, the two counseling 
groups, and a randomly selected contro] group from 
the nonclient group. A deviant response was defined 
as each time a typical adjective was not checked 
(from the 33 commonly selected adjectives) or each 
time an atypical adjective was checked (from the 
39 uncommonly selected adjectives). The four groups 
studied were: 24 psychiatric treatment cases, 37 per- 
sonal counseling cases, 186 educational-vocational 
cases, and 150 randomly selected nonclient controls 


RESULTS AND DISCUSSION 


The means and standard deviations of the 
deviant response scores for each of the four 
groups are presented in Table 1. The psy- 
chiatric and personal counseling groups gave 
more deviant responses than did the educa- 
tional-vocational counseling group or the con- 
trol group. 

The summary of an analysis of variance of 


TABLE 1 


MEANS AND STANDARD Deviations oF Deviant RE- 
SPONSES OF PsycHIaATRIC, PERSONAL COUN 
SELING, VOCATIONAL COUNSELING, 

AND CONTROL GROUPS 


Group Mean 


14.29 
12.08 
9.98 


Psychiatric 
Personal 
Vocational 
Control 


TABLE 2 


ANALYSIS OF VARIANCE OF DEVIANT RESPONSE SCORES 
OF PSYCHIATRIC, PERSONAL COUNSELING, VOCA 
TIONAL COUNSELING, AND NONCLIENT 
CONTROL GROUPS 


Mean 

Source of Variation Square 
Between groups 

Psychiatric & Personal 

vs. 

Vocational & Control (1) 

Psychiatric vs. Personal (1) 

Vocational vs. Control (1) 
Within groups 393 


186.57 


(484.66) 

(71.14) 

(4.92) 
47.72 


Total 396 


* Significant at the .01 level of confidence 


the deviant responses is given in Table 2. 
The between-group means F test, which 
yielded an F value of 3.91, significant at the 
.0O1 level, was followed by three orthogonal 
comparisons between the various groups. The 
first of these comparisons indicates that the 
psychiatric group and the personal counsel- 
ing group, when combined, differ significantly 
from the vocational group and the control 
group taken together, when compared for av- 
erage number of deviant responses. The two 
groups with personality problems give sig- 
nificantly more deviant responses, a finding 
consistent with Berg’s hypothesis. The re- 
maining comparisons show that the psychi- 
atric group does not differ significantly from 
the personal counseling group in deviant re- 
sponse scores and that the vocational group 
does not differ significantly from the control 
group in average number of deviant responses. 

The differences in the data are consistent 
with Berg’s Deviation Hypothesis. The two 
groups who sought professional help for per- 
sonal adjustment problems, the psychiatric 
group and the personal counseling group, pre- 
sumably exhibited more personality disturb- 
ance than did members of the nonclient con- 
trol group or members of the educational- 
vocational counseling group. In the present 
study, although most of the deviant adjectives 
are of the type that would be selected by sub- 
jects with a response set to describe them- 
selves in self-depreciating terms, only 39% of 
the adjectives in the present study occur in 
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Gough’s list of “least favorable” adjectives, 
and only one of the 30 items of Gough’s Ad- 
justment Scale from his Adjective Check List 
(1955) occurs in the total pool of commonly 
and uncommonly selected adjectives of the 
present study. 

We call attention to a novel element in 
our study. The traditional manner of deriv- 
ing an empirical discrimination between two 
groups has been to give a test to two criterion 
groups, e.g., well-adjusted and maladjusted, 
select the discriminating items by an item 
analysis, and then to cross-validate the em- 
pirical key with new groups. In the present 
study, modeled to test Berg’s hypothesis, the 
empirical test key was established by study- 
ing a general population in terms of typical 
and atypical responses. Then various criterion 
groups drawn from a new population were 
used to ascertain the validity of the deviant 
response key. 


CONCLUSION 


A list of commonly selected and uncom- 
monly selected adjectives from the Gough 
Adjective Check List was established on the 
basis of responses of a group of university 
freshmen. When this list of frequently chosen 
and infrequently chosen words was given to 


a new sample of freshmen, it was found that 
the deviant response scores of freshmen who 
later became personal counseling or psychi- 
atric treatment cases were significantly greater 
than the deviant response scores of a control 
group of nonclients. A group of freshmen who 
later applied for educational-vocational coun- 
seling and were retained in that classification 
by their counselors did not deviate signifi- 
cantly from the control group in their deviant 
response scores. These results are consistent 
with Berg’s Deviation Hypothesis. 
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BRIEF REPORTS 


INDIVIDUAL AND SOCIAL CORRELATES OF PRISON ESCAPES’ 


D. A. DOBBINS,? F. E. STOCKWELL 
Louisiana Department of Institutions 
anp W. S. LOVING 


Louisiana State Penitentiary 


Many prison administrators believe prison es- 
capes are primarily situational, based on op- 
portunity and triggered by stresses of the prison 
environment. The fact remains that some in- 
nates will escape and others will not even when 
opportunities are equal. This study attempts to 
identify some personal and social characteristics 
of the “escape risk” inmate and, in so doing, 
help fill a curious scarcity of empirical research 
on the subject. 

The Ss were 200 white male inmates of the 
Louisiana State Penitentiary. Half had success- 
fully escaped over a two-year period. The re- 
maining 100, matched by month and year of ad- 
mission and degree of custody, were nonescapees. 
Each escapee was thus matched with a non- 
escapee imprisoned for the same length of time 
and under the same degree of supervision at the 
time of escape. 

Twenty-two variables were subjected to chi 
Eight variables 
the criterion groups above the . 
Traits representative of 


discriminated 
5 confidence level. 

groups were 
fewer dependents. more prior out of 
state penal commitments 


square analysis. 


as follows: 
more property crimes 
committed, more juvenile commitments, younger 


1An extended report of this study may be ob- 
tained without charge from F. E. Stockwell, Louisi- 
ana Department of Institutions, Baton Rouge Louisi- 
ana, or for a fee from the American Documentation 
Institute. Order Document No. 6114, remitting $1.25 
for microfilm or $1.25 for photocopies 

2Now with the Personnel Research Branch, 
TAGO, Washington, D. C. 


at first arrest, from smaller cities, from other 
states, and fewer years residence within Louisi- 
ana. 

Three additional variables were significant be- 
tween the .05 and .10 levels. Traits predominant 
in the escape group were: longer sentences, lower 
tested educational achievement, and younger at 
admission. 

Variables which did not discriminate between 
he two groups were: Otis IQ, marital status, 
yroken homes, urban backgrounds, father’s occu- 
pation, number of siblings, previous prison com- 
mitments within state, veteran status, psychiatric 
history, years school completed, and number of 
institutionalized siblings. 

Significant variables were intercorrelated and 
cluster analyzed using Tryon’s modification oi 
Holzinger and Harman’s B coefficient. The first 
cluster, with a B coefficient of 2.41, included the 
following variables: years residence within Louisi- 
ana, distance to home state, number penitentiary 
commitments elsewhere, and size of community 
This cluster was labeled “Transient Criminality.” 

The remaining four variables 
within one additional cluster with a B coefficient 
of 2.64. The cluster, identified as “Early Crimi- 
nal History,” included number of juvenile com- 
mitments, age at first arrest, number dependents, 
and commission of property crimes. 

The results suggest that with situational factors 
held reasonably constant, the risk of a given 
prisoner’s escaping is related to certain precom- 
mitment life patterns. 
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Brief Reports 


A DIMENSIONAL ANALYSIS OF CLINICAL JUDGMENT ! 


JOHN W. DONAHOEF 


University of Kentucky 


Commonly, studies of clinical judgment se- 
verely restrict the mode of response open to the 
judge (e.g., psychiatric diagnosis). This is done 
in order to yield a readily quantifiable dependent 
variable (e.g., number of correct classifications). 
Unfortunately, such decisions constitute an un- 
- representative sample of the kinds of judgments 
that clinicians are called upon to make. Rather, 
what appears to be needed is an instrument allow- 
ing for more diversified responses so as to provide 
a knowledge of the dimensions along which 
clinical judgments vary. The present study was 
an attempt to develop a rating scale to fulfill 
this function. 

The rating consisted of 50 descriptive and 25 
dynamic seven-step scales. The descriptive 
scales were chosen from adjectives and phrases 
used by Beck in the interpretation of test ma- 
terials. Terms of opposed meanings (e.g., con- 
stricted—spontaneous) were selected as scale end- 
points. The 25 dynamic scales dealt with the 
chief sources of conflict and the mechanisms of 
defense according to psychoanalytic theory (e.g., 
never uses projection—very frequently uses projec- 
tion). 

The Rorschach productions of six hospitalized 
mental patients were scored and summarized 
The diagnostic categories represented were schizo- 
phrenia, depressive reaction, personality trait dis- 
turbance, and chronic brain syndrome. Twelve 


1 An extended report of this study may be obtained 
without charge from John W. Donahoe, Psychology 
Department, University of Kentucky, Lexington, 
Kentucky, or for a fee from the American Docu- 
mentation Institute. Order Document No. 6115, 
remitting $1.25 for microfilm or $1.25 for photo- 
copies. 

2 This study was conducted while the author was 
employed at the Veterans Administration Hospital, 
Lexington, Kentucky. 


clinical psychologists with an average of 3.1 years 
of diagnostic experience served as judges. All 
judges were successively presented with the six 
protocols and asked to place a check mark on 
each of the 75 scales and also to diagnose the 
patient in accord with conventional psychiatric 
nomenclature. 

The median interjudge reliability coefficient was 
51, with a range from .18 to .98. In order to 
examine the structure of the rating form, a scale 
matrix was constructed. Those scales with 
squared vector lengths of less than 5.0 were not 
included, thereby reducing the number of de- 
scriptive scales from 50 to 42 and the number of 
dynamic scales from 25 to 6. The reduced 
matrix was analyzed by the D method of factor- 
ing. The first factor was defined by the scale end- 
points much hostile conflict (pivot), neurotic, 
unstable, insecure, and anxious. These are terms 
intimately associated with the concept of neurot- 
icism. The second factor dealt with a continuum 
from introversion to extroversion ‘as the scales 
having highest coordinates were cautious (pivot), 
introversive, constricted, and withdrawn. The 
third factor, while not as homogeneous as the 
others, was concerned with psychosis. 
heavily loaded on the third factor were organic 
brain involvement (pivot), defenses holding 
poorly, psychotic, and underreactive. 

The judges were correct in only 38% of their 
diagnoses. When the matrix was analyzed with 
respect to Rorshach protocols, however, the rec- 
ords were appropriately grouped. Thus, while the 
judges were capable of checking the rating form 
with some validity, they were unable to integrate 
these separate judgments into an accurate global 
statement. In short, the scaling technique should 
provide a means for assessing clinical judgment 
under a wide variety of conditions. 


Scales 


(Received June 12, 1959) 
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SOME CORRELATES OF MANIFEST ANXIETY IN CHiLDREN '! 


IRA ISCOE ann IRENE COCHRAN 


University of Texas 


Two hypotheses were tested. 1. If a re- 
lationship exists between anxiety as measured by 
the CMAS and observable behavior, then children 
who score high on the CMAS should be judged 
as significantly more “maladjusted” than their 
classmates of the same sex who score low on 
this scale. 2. If Manifest Anxiety is related 
to adjustment. it should bear a significant re- 
lationship to the California Test of Personality 
a frequently used instrument to assess “adjust- 
ment” in school age children. 

For Hypothesis 1, 118 boys and 96 girls, aver- 
age age 10.4 years, from six elementary school 
classrooms, were administered the CMAS. For 
each grade the five boys and five girls having 
the highest and lowest anxiety scores were se- 
lected. A total of 60 Ss was thus obtained. 
Mean HA was 25.5 and LA was 3.4. Homeroom 
teachers rated each S on “adjustment” compared 
to own sex group, using a five-point rating scale. 
They were unaware of CMAS scores of the stu- 
dents. y? was equal to 8.2 (df=2) which is 
significant at the .02 level. A rank order correla- 
tion yielded a rho of .37, significant at .01 level. 
Manifest Anxiety would appear to be an ob- 
servable characteristic, noticeable to those fa- 
miliar with the Ss’ everyday school behavior 
Since the concept of Manifest Anxiety is de- 
rived from Hullian theory, a consideration of the 


1 An extended report of this study may be ob- 
tained without charge from Ira Iscoe, Department 
of Psychology, University of Texas, Austin 12, Texas, 
or for a fee from the American Documentation Insti- 
Order Document No. 6118, remitting $1.25 for 
microfilm or $1.25 for photocopies 


tute 


relationship between high and low drive and 
teachers’ judgments seems appropriate. Ques- 
tioning of teachers as to the basis of their judg- 
ments revealed a tendency to equate “maladjust- 
ment” with “restlessness,” lack of attention, and 
inability or unwillingness to “settle down.” Good 
adjustment indicated the opposite tendencies. 
There is some support, therefore, for the belief 
that drive level was a prominent observable 
characteristic. 

For Hypothesis 2, 228 fourth- and sixth-grade 
children were administered the California Test 
of Personality and the CMAS. High scores on 
the former indicate good adjustment 

CMAS correlated — .48 with “Total Adjust- 
ment,” — .49 with “Personal Adjustment,” and 
— .59 with “Freedom from Nervous Symptoms.” 
These r’s are highly significant (p = .01). 

The results of Hypothesis 1 suggest a relation- 
ship between general body activity and judgments 
of adjustment. This is conmmensurate with the 
concept of Manifest Anxiety as a measure of 
drive level. The results of Hypothesis 2 suggest 
a significant relationship between high drive level 
and adjustment and also the willingness to admit 
certain subjectively experienced discomforts and 
apprehensions. The possibility of similarity of 
item:content on both instruments should also be 
considered. 

Despite certain shortcomings. the concept of 
Manifest Anxiety in children as a measure of 
drive level, if not of adjustment. has in it 
potentialities that deserve further study. 


(Received August 9, 1959) 
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A CLUSTER ANALYSIS OF OBJECT SORTING BEHAVIOR * 


A. B. SILVERSTEIN 
Pa ific State Hospital 


Clinical psychologists are frequently called 
upon to assess a patient’s ability to think ab- 
stractly, and we possess a number of instruments 
with which we endeavor to measure this ability 
and gauge its impairment. Not much is known, 
however, concerning the relationships among the 
measures which these instruments yield. The 
aim of the present study was to clarify the inter- 
relations of certain scorable aspects of the Gold- 


stein-Scheerer object sorting test as used by 
Rapaport, Gill, and Schafer (1945). 
The data were those reported by Rapaport 


et al. for 54 randomly selected members of the 
Kansas Highway Patrol. Thirteen variables were 
selected for analysis. All distributions were 
dichotomized as close to the median as possible, 
and ¢/@m,. was employed as the index of associ- 


ation. McQuitty’s elementary linkage analysis 
was then utilized to treat the data. 
The analysis yielded three clusters. two of 


which correspond closely to Part I (active con- 
cept formation) and Part II (passive concept 
formation) of the object sorting test. One ex- 
ception was that Concrete Definitions from Part 
II of the test appeared in the cluster with the 
active concept-formation variables. The second 


1 An extended report of this study may be obtained 
without charge from Arthur B. Silverstein, Pacific 
State Hospital, Pomona, California, or for a fee 
from the American Documentation Institute. Order 
Document No. 6117, remitting $1.25 for microfilm 
or $1.25 for photocopies. 

2 Formerly at the Psychiatric Institute, University 
of Maryland. 

3 The author is indebted to D. Rapaport and to 
the Year Book Publishers for permission to make 
use of material appearing in Diagnostic Psychological 
Testing. 


exception was that Functional Definitions from 
both parts of the test formed a cluster of their 
own, 

Rapaport et al. consistently distinguish between 
active and passive concept formation and place 
some emphasis on the diagnostic significance of 
differences in performance on Parts I and II of 
the test. The findings of this study indicate that 
the distinction between the two parts of the test 
is quite justified. Not all research workers make 
this distinction, however. A comparison of per- 
formance on Parts I and II as scored by the 
“conceptual area” analysis of McGaughran and 
Moran (1956) would appear highly desirable in 
the light of the present findings. 

It has been suggested that more valid estimates 
Wechsler’s vocabulary 
might be obtained if functional definitions were 
to receive less credit than concrete definitions 
since the order abstract-concrete-functional ap 
pears to be closer to the truth than the conven 
tional order abstract-functional-concrete. In con- 
trast to this, the results of the present analysis 
suggest that while abstract and concrete defini- 
tions may lie on a continuum, functional defini- 
tions actually represent a thing apart. 


of intelligence. e.g.. on 
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PSYCHOLOGICAL TEST REVIEWS 


New Tests 


Stott, D. H., and Sykes, Miss E. G. Bristol Social 
Adjustment Guides. Age 5-15.1 Adjustment guide 
(observation blank), scoring templates, and diag- 
nostic form (summary record blank) for (a) the 
child in school (boy) BG-1, and (girl) BG-2; 
(b) the child in residential care BG-3; and (c) 
the child in the family BG-4. Guides BG-1 and 
BG-2 use the same scoring templates and diag- 
nostic form. Guide BG-—4 includes a life history 
blank and is scored using a copy of the guide 
with overprinted key in licu of templates. 

The Bristol Social Adjustment Guides constitute 
a careful attempt to render more objective and pre- 
cise the observations of teachers, social workers, and 
foster parents, with respect to children’s social 
adjustment. The Guides are intended to save time 
for the psychologist by providing accounts of be- 
havior more standardized and compact than anec- 
dotal protocols, and more specific than impression- 
istic ratings. Since the Guides encourage accuracy 
in observing children and provide a system for 
classifying symptomatic behavior, they have a place 
also as aids in clinical training 

Each Guide consists of numerous adjectives and 
phrases grouped by topic and scrambled as to de- 
sirability, though with negative indications in the 
majority. The observer simply underlines those terms 
which apply to the child concerned. Translucent 
templates are provided on which critical items are 
keyed for maladjustment in specific areas such as 
“hostility to adults,” “anxiety,” “knavery,” and 
“psychosomatic symptoms.” The results can be 
quasi-quantified on the “diagnostic form” on which 
the keyed items are recorded, by counting the sig- 
nificant items marked in each category. Items in 
certain areas are also grouped as to severity. How- 
ever, the author eschews any overall index of mal- 
adjustment. 

Stott and Sykes are thoroughgoing empiricists, 
and their work is a good example of an inductive 
approach to conceptualizing maladjustment Avoid- 
ing the constructs of current theories, they collected 
descriptions of specific children as reported in case 
conferences. By successive tryouts with teachers, 
social workers, etc., they developed lists of descrip- 
tive terms which could be used comfortably and 
unambiguously by nonpsychologists. Items were 
grouped into approximately 15 “attitudes,” eg. 
“depression” or “hostility to adults,” by a scalogram 
like procedure in which children’s names on one 
axis of a large grid and their symptoms on the 

1 With manual, The Social Adjustment of Children, 
pp. 46. London: University of London Press, 1958. 


other were ordered and recorded 
syndromes of behavior emerged. Only those items 
are now scored whose relative frequencies among 
the stable vs. the maladjusted (omitting a middle 
category of “unsettled children’) yielded significant 
x?’s. However, the “attitude” syndromes shade into 
one another along a behavioral spectrum and have 
merely descriptive significance 

Since the Adjustment Guides are intended only to 
elicit and organize the specifics of observed behavior, 
not all the usual criteria of test evaluation are rele- 
vant. The 46-page manual by includes a 
lengthy defense of his scientific philosophy and a 
detailed historical account of the 
in the development of the Guides 
(interrater agreement) in a preliminary study is 
given as around .76. More data are needed, 
reliability is likely to depend as much upon the 
raters’ skills and opportunities as upon the cr 
of these forms. The validity of each in dif- 
ferentiating between the stable and the maladjusted is 
presented, but without cross-validation Normative 
data for the interpretation of total scores on each 
attitude are lacking. This is not a serious omission 
for American users who will need to develop their 
own norms anyway 

Differences between the English and American 
languages constitute a more fundamental problem 
The author’s success in finding meaningful descriptive 
phrases drawn from the colloquial speech of nurses 
teachers, etc. eg. “spivvish dress,” or “feckless,” 
will occasionally create 
for Americans. 


until coherent 


Stott 


successive steps 
Their reliability 


but 


ntent 


tem 


problems of understanding 
Even clinical vocabulary has its dif- 
ferences—e.z., “unforthcomingness” for “shy,” or 
“unsettled” for “mildly maladjusted.” Those of us 
used to the euphemistic polysyllables of “psycho- 
pathic deviate” may feel decidedly 
characterizing a child as a “knave.” The Guides, 
jn summary, will probably not achieve their po- 
tential value in this country until someone undertakes 
to translate them from the autl English 
ours —Read D. Tuddenham 


self-conscious 


into 


Carrol, J. B. & Sapon, S. M 
Aptitude Test. Ages 14—adult. 2 forms, long 
(60 min.) and short (30 min.). Hand or IBM 
scoring. Tape recording necessary for administra 
tion of long form ($7.50) ; test yklets ($3.5 
25), IBM answer sheets 50), 
stencils ($0.50 per set), specimen set 
tape) ($0.75), manual, 27 pp. New York 
logical Corp., 1959. 

The Modern Language Aptitude Test, the outcome 
of a five year study, includes aural and visual ma- 


Modern Language 


) for 
scoring 
(without 
Psych 


99 


if 
| 
ow 
a 
sf 
=, 
» 
4 
pict 
4 
3 
¥ 
a 
; 
|_| 
5 
A 
: 


100 


terials. The long form which contains both types 
of materials requires a tape recorded for administra- 
tion. The two aural parts of the test, included only 
in the long form, are “Number Learning” and 
“Phonetic Script.” The other three are: “Spelling 
clues,” requring recognition of incompletely spelled 
words; “Words in Sentences,” testing sensitivity to 
grammatical structure; and “Paired Associates,” in- 
tending to tap memory. 

Sufficient data bearing on reliability and validity 
are presented to warrant issuing the test and to 
establish confidence in its usefulness as a diagnostic 
device. The normative data, particularly for high 
school and adult norms, are weak.—E. S. B. 


Stark, R. H. Polyfactorial Study of Personality. 
Adults. Test booklets ($6.25 per package), IBM 
answer sheets ($2.75 per package), stencils ($2.00), 
profile sheets ($2.75) specimens set ($2.50), manual, 
11 pp. ($0.75). New Rochelle, N. Y.: Bruce, 
1959. 

A 300-item inventory reportedly selected on the 
basis of factor analysis and correlations with un- 
specified criteria. The general impression is that the 
major criteria may have been other tests, e.g., Ror- 
schach, MMPI, and the H-T-P test. Ten scores 
are obtained and Tables of Correlations with MMPI, 
the AsSociation Adjustment Inventory, and certain 
subtests of the WAIS are presented. Test-retest re- 
liability for a prison population suggests that the 
scores are reasonably stable. No other validation 
data are offered, even though reference is made to 
various possible criterion populations. There are no 
references to published reports. Why was this test 
issued after so little developmental analysis ?— 
B. 


Cassell, R. N. Test of Social Insight. Two forms, 
youth edition (junior and senior high school) and 
adult edition. Untimed. Test booklet ($6.25 per 
package) ; IBM answer sheets ($2.75 per package) ; 
specimen sets ($2.50); Profile sheet ($2.75 per 
package); IBM scoring stencils ($1.00), manual, 
30 pp. ($1.75). New Rochelle, N. Y.: Bruce, 
1959. 

The Test of Social Insight consists of 60 five 
alternative multiple choice items dealing with home 
and family relations, authority figures and social 
agencies, play and avocational interests, and work 
and vocational interests. The two editions are 
parallel in content, only slightly altered to adapt to 
the different age groups. In addition to a total 
score, five part scores: withdrawal, passivity, co- 
operativeness, competitiveness, and aggressiveness, 


Psychological Test Reviews 


are obtained. There are two sets of norms for the 
youth edition (junior and senior high school stu- 
dents) and two sets for the adult edition (college 
students and adults and guidance counselors and 
personnel psychologists). 

The odd-even reliabilities, even after Spearman- 
Brown corrections, are unsatisfactory, none above 
88 and, in some populations, as low as .51. Validi- 
ties based on inadequately reported unpublished data 
are more reassuring. Caveat Emptor!—E. S. B. 


Bruce, M. M. Bruce Vocabularly Inventory. Adult. 
Untimed (20 min.); Specimen sets ($1.00); Test 
booklet ($5.00 per package); IBM answer sheet 
($2.75 per package); manual, 3 pp. ($0.75); 
scoring keys, hand sets ($2.00), and machine sets 
($4.50). New Rochelle, N. Y.: Bruce, 1959 
A 100-item multiple choice test in which the key 

words were chosen at random from the even num- 

bered pages of the Oxford Universal Dictionary. 

Norms are given for eight not very precisely defined 

occupational groups and for the aggregate. The 

groups vary from 61 to 495 in size. While the re- 
ported reliabilities, odd-even, .92, and _ test-retest, 

84, are acceptable, the test correlates only slightly 

more (approximately .80) with other vocabulary 

tests than with intelligence tests (approximately .60). 

—E. S. B 


Bruce, M. M. Association Adjustment Inventory. 
Adult. Untimed (10 min.); Test booklet ($5.00 
per package) ; IBM answer sheets ($2.75 per pack- 
age); scoring keys, hand set ($2.00) and machine 
set ($4.50); manual, 15 pp. ($0.75). New 
Rochelle, N. Y.: Bruce, 1959 
The Kent-Rosanoff list of stimulus words has been 

cast up into a four alternative multiple choice 

format. Thirteen different scores are obtained, in- 
cluding a total (maladjustment) and an immaturity 
score, and 11 dimensions of “deviate ideation.” 

Tables of intercorrelation raise serious doubts as to 

whether there are that many independent scores 

Norms for “institutionalized” and “general” popula- 

tions are based on large but vaguely defined popula- 

tions. Test-retest correlations for one week and 
one month intervals are satisfactory, but the base for 
claiming validity is very unsatisfactory. There is 
evidence of some general relationships with MMPI 
scales and with a largely unvalidated inventory. 
Reference is made to unreported results of diagnostic 


validity studies that are unpublished. It should 
have been marked “For Experimental Use.”— 
E. S. B. 
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SCORING HUMAN MOTIVES 
John Dollard and Frank Auld, Jr. 


Two noted psychologists present a method for studying the psycho- 
therapeutic process, designed to measure growth, change, and progres- 
sion within the patient and the evolution of the relationship between 
the patient and the therapist. This manual gives instructions for coding 
each “unit” (usually a sentence) of a psychotherapeutic interview, 
ascribing to each unit the conscious or unconscious motivation behind it. 


Sound recordings from actual cases have been transcribed, enabling the 
reader to trace for himself the series of steps from raw data through 
scoring. Hundreds of illustrations of the scoring are given in the in- 
structions, and there are two complete interviews for practice scoring. 


This remarkable new book makes accessible for scientific analysis what 
was once a matter of untested opinion. 


$9.50 


PERSONALITY 


AND PERSUASIBILITY 


Edited by Carl I. Hovland and Irving L. Janis 


This investigation of responsiveness to persuasive communications de- 
velops a new concept, “general persuasibility”—the individual's suscep- 
tibility to social influences irrespective of the persuader or what he ad- 
vocates. The tests used by its ten contributors assess the characteristics 
of resistant individuals, gullible ones, and those with discriminating re- 
sponse, analyzing them in terms of personality factors, social attitudes, 
sex, and intelligence. 


$5.00 


Yale University Press 


New Haven Connecticut 
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THE STANFORD-BINET 
INTELLIGENCE SCALE: 1959 Revision 


Lewis M. Terman and Maud Merrill 


January 1960 


COUNSELING FOR PERSONAL ADJUSTMENT 
IN SCHOOLS AND COLLEGES 


Fred McKinney, University of Missouri 
584 pages 1958 $6.50 


HOUGHTON MIFFLIN COMPANY °* Boston 
New York Atlanta Geneva Dallas Palo Alto 
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TIMELY McGRAW- HILL BOOKS 
PRINCIPLES OF COMPARATIVE PSYCHOLOGY 


By H. Wartas, University of Florida; Donorny A. University 
of Florida; and E. The George Washington University. McGraw- 
Hill Series in Psychology. 453 pages, $7.95 


This book represents the combined efforts of a group of distingwished psychologists working in 
the field of animal study. The purpose is to present a broad coverage of the field of animal 
behavior—to acquaint the reader with the methods and results of comparative studies and the 
principles to be drawn from them. Emphasis is placed a current experimental and theoreti- 
cal literature, without neglecting the standard material to be expected: in such a survey. 


QUANTITATIVE METHODS IN PSYCHOLOGY 


By Dow Lewrs, Siate (Jniversity of Iowa. McGraw-Hill Series in Psychology. 558 
pages, $9.50 


A text for the increasingly important “Quantitative Methods” “Quantitative Analysis” psy- 
chology courses. It is concerned with the application of hepatetiationl mathematical and graphi- 
cal techniques to the clarification and interpretation of experimental data. The aim of the book 
is to provide a comprehensive survey of the quantitative and statistical procedures hasic to the 
utilization of mathematical functions in describing empirical relationships and in developing 
theoretical scl ema for behavior. 


EXPERIMENTS IN HEARING 


By Geozc von Bexesy, Harvard University. McGraw-Hill Series in Psychology. 744 
pages, $25.00 


An cminent international psychologist and physicist makes available to pbhv-‘ologists and psy- 
chologists the results of his unique ground-breaking experiments in the fie\ x hearing. Over 
the years, von Bekesy has created his own experiments and has established and 
neurological facts concerning hearing which have advanced this field considerably. The book 
is based on most of the papers published by the awthor during the last 30 years. 


STATISTICAL ANALYSIS IN PSYCHOLOGY AND 
EDUCATION 


By Georce A. Fercuson, McGill University. McGraw-Hill Series in Psychology. 354 
pages, $7.00 


A textbook introducing the ideas and applications of statistics for students of the behavioral sci- 
ences—education, psychology, psychiatry, and sociology. Emphasis is placed on the analysis 
and interpretation of data resulting from the conduct of experiments with human and animal 
subjects. The book provides the technology necessary for the statical treatment of rnost sets 
of experimental data encountered in practice. 


Scnd for Copies om Approval 


McGraw-Hill Book Company, Inc. 


330 West 42nd Street New York 36, N.Y. 
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INTEGRATED SERVICES 


THE SLOW-LEARNER or the child with emotional difficulties 
needs the resources of an organization that has many different 
approaches to the problems involved. Through the Devereux 
multidiseiplined approach, the psychiatrist, the physician, the 
psychologist, the educator, and the rehabilitation specialist pool 
experiences to give each boy and girl the environment and train- 


ing best designed to meet his individual needs. Students are 


assigned to one of twenty-two semi-autonomous residential 
schools in Pennsylvania and similar groups in California and 


CLINICAL STAFF 


THE DEVEREUX FOUNDATION 


A nonprofit organization 


Devon, Pennsylwania 
Senta Barbora, California 


HELENA T. DEVEREUX 
Administrative Consultant 


EDWARD L. FRENCH, Ph.D. 
Director 


WILLIAM B. LOEB 
Treasurer 


Lance Wright, M.D. 

Elisworth Henry, 8.T.D. 
Milten Brutten, Ph.D. 
Willicm J. Cohen, Ph.D. 
Durethy E. Conrad, Ph.D. 
Sidney L.. Copel, Ed.D. 


SCHOOLS 
Founded 1912 COMMUNITIES 
CAMPS 
TRAINING 
Victoria, Texas RESEARCH 


Professional inquiries should be 
addressed to Jouw M. Barotay, 
Director of Development, of 
Cuarizs J. Registrar, 
Devereux Schools, Devon, Pa; 
western residents address Kura 
A. Sratow, Registrar, Deverwux 
Behools in California, Sauta Bar- 
bara, Calif.; southwestern resi- 
dents addrese Devereux Schools in 
Texas, Box 236, Victoria, Tex. 
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Texas. 
J. Clifford Scott, M.D. 
Edwin H. Abrabamsen, M.D. 
Aurelie Buonanno, M.D... 
Charles M. Campbell, Jr., M.D. 
Fred J. Culeman, M.D. 
Ruth E. Daffy, M.D. 
William F. Haines, M.D. Michael B. Dunn, Phd. 4 
Robert L. Hunt, M.D. Shirley M. Jahnson, Ph.D. an 
Richard H. Lambert, M.D. Joha B. Kiciser, Ph.D. Ej 
Leonardo Migran, M.D. Murray Levine, Ph.D. 
Joseph J. Peters, M.D. Henry Pintt, Ph.D. 
Alvis J. Scull, M.D. Edgar A. Smith, Ed.D. 
Jacob 8. Sherson, M.D. George Spivak, Ph.D. = 
Albert 8S. Terzian, M.D. Herbert A. Sprigie Ph.D. 
Walter M. Ubier, M.D. Anne Howe, M.S. 
Tirso L. Vinueza, M.D. Kenneth E. Evans, B.S. 
Psychoanalytic Consultants 
| G. Henry Kats, M. D.j Herbert Hi. Herskovite, M.D. 
— 


| 


